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~10x). Also, some tumor lines have amplified ras 
genes. A 20-fold increase in the level of a nontrans- 
forming Ras protein is sufficient to allow the 
U*ansformation of some cells. The effect has not 
been fully quanlilaled, but it suggests the general 
conclusion that oncogenesis depends on over- 
activity of Ras protein, and is caused either by 
increasing the amount of protein or (probably more 
efficiently) by a variety of mutations that increase 
the activity of the protein. 

Transfection by DNA can be used to transform 
only certain cell types. Although transforming 
oncogenes have been isolated from both rodent and 



human cells, most targets for transformation by 
transfection with oncogenes have been rodent 
fibroblasts in culture. (In fact, the difference in the 
source of the oncogene [human) and the recipient 
cell [rodent] is an important factor in allowing 
the donor gene to be distinguished unequivocally 
from recipient DNA.) Limitations of the assay 
explain why relatively few oncogenes have 
been detected by transfection. This system has 
been most effective with ras genes, where there 
is extensive correlation between mutations that 
activate c-ras genes in transfection and the 
occurrence of tumors. 



Insertion 



A variety of genomic changes can activate proto- 
oncogenes, sometimes involving a change in the 
target gene itself, sometimes activating it without 
changing the protein product. In cases of insertion 
and translocation, there is evidence that the 
genomic change is the causative event; in cases of 
amplification there is a correlation with tumori- 
genesis, but no direct proof for a causative role. 

Ndany tumor cell lines have visible regions of 
chromosomal amplification, as shown by homo- 
geneously staining regions (see Figure 56.27) or 
double minute chromosomes (see Figure 56.28). In 
some cases, the amplified region contains a known 
oncogene or a gene related to one. In other cases, 
where amplification is not visible, the use of 
batteries of probes representing oncogenes shows 
that a particular oncogene is amplified. Examples 
of oncogenes that are amplified in various tumors 
include c-myc, c-abl, c-myb, c-erbB, and c-K-ras. 

Established cell lines are prone to amplify genes 
(it is one of several karyotypic changes to which 
they are susceptible). AH the same, the presence of 
known oncogenes in the amplified regions, and the 
consistent amplification of particular oncogenes in 
many independent tumors of the same type, again 
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strengthens the correlation between increased 
expression and tumor growth. Of course, it is 
possible that the gene amplification gives an 
advantage to growth of the established tumor; it is 
not necessarily an event involved in its initiation. 

Some proto-oncogenes are activated by events 
that change their expression, but which leave their 
coding sequence unaltered. The best characterized 
is c-myc, whose expression is elevated by several 
mechanisms. One common mechanism is the 
insertion of a noneffective retrovirus in the vicinity 
of the gene. 

The ability of a retrovirus to transform without 
expressing a v-onc sequence was first noted during 
analysis of the bursal lymphomas caused by the 
transformation of B lymphocytes with avian virus. 
Similar events occur in the induction of T cell 
lymphomas by murine leukemia virus. In each case, 
the transforming potential of the retrovirus seems to 
lie with its LTR rather than with a coding sequence. 

In many independent tumors, the virus has 
integrated into the cellular genome within or close 
to the c-myc gene. The gene consists of three 
exons; the first represents a long noniranslated 
leader, and the second two code for the c-Myc 
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protein. Figure 59.8 summarizes the types of in- 
sertion at this locus. The retrovirus may be inserted 
at a variety oflocations relative to the c-myc gene. • 

The simplest insertions to explain are those that 
occur within the first intron. The LTR provides a 
promoter, and U^anscription reads through the two 
coding exons. Transcription of c-myc under viral 
control differs in two ways from its usual expres- 
sion: the level of expression is increased (because 
the LTR provides an efficient promoter); and the 
transcript lacks its usual nonlranslated leader 
(which may usually limit expression). 

Activation of c-myc in the other two classes of 
insertions reflects different mechanisms. The 
retroviral genome may be inserted within or 
upstream of the first intron, but in reverse orienta- 
tion, so that its promoter points in the wrong 
direction. Probably the LTR provides an enhancer 
that acts on an upstream sequence that fortuitously 
resembles a promoter. The retroviral genome also 
may be inserted downstream of the c-myc gene, in 
which case transcription initiates at the usual c-myc 
promoter(s) , but is increased by the enhancer in the 
retroviral LTR. 

In all of these cases, th e coding sequence oj c-myc 
is unchanged, so oncogenicity is attributed to the loss 
of normal control, and increased expression, of the 
gene. 

Other oncogenes that are activated in tumors by 
i the insertion of a retroviral genome include c-erbB, 
I c-myb, c-mos, c-H-ras, and c-raf. Up to 10 other cel- 
I hilar genes (not previously identified as oncogenes 
fby their presence in transforming viruses) are 
| implicated as potential oncogenes by this criterion. 
iTThe best characterized among this latter class are 
\wntl and int2. The wntl gene codes for a protein 
| involved in early cmbryogenesis that is related to 
|the wingless gene of Drosophila; int2 codes for a 
|growth factor related to FGF. 
& ■ Translocation to a new chromosomal location is 

nr.- 

another of the mechanisms by which oncogenes are 
activated. Certain chromosomal translocations are 
^consistently associated with activation of onco- 
genes that lie near the breakpoints. This situation 
j#vas originally discovered via a connection 
Ibetvveen the loci coding immunoglobulins and the 



Figure 39.8 
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occurrence of certain tumors. Specific chromo- 
somal translocations are often associated with 
plasmacytomas in the mouse and with Burkitt 
lymphomas in man. These tumors arise from 
undifferentiated B lymphocytes. The common 
feature in both species is that an oncogene on one 
chromosome is brought into the proximity of an Ig 
locus on another chromosome. Similar events 
occur in T lymphocytes to bring oncogenes into the 
proximity of a TcR locus. 

The basic cause of the translocation event is a 
malfunction of the system responsible for recom- 
bining V and C genes or switching between TgH 
C genes. Instead of acting on two sites within the Ig 
or TcR locus, the system recombines the immune 
locus with an unrelated region on a different 
chromosome. This results in a reciprocal 
translocation, which is illustrated in Figure 59.9. 

We do not know the basis for the involvement of 
the nonimmune partner, but in both man and 
mouse it is often the c-myc locus. In man, the 
translocations in B cell tumors usually involve chro- 
mosome 8, which carries c-myc, and chromosome 
14, which carries the IgH locus; -10% involve chro- 
mosome 8 and either chromosome 2 (k locus) 
or chromosome 22 (lambda locus). The trans- 
locations in T cell tumors often involve 
chromosome 8, and either chromosome 14 (which 
has the TcRa locus at the other end from the Ig locus) 
or chromosome 7 (which carries TcR /3 locus). 
Analogous translocations occur in the mouse. 

Translocations at the IgH locus in B cells fall into 
two classes. One type is similar to those observed 
at other Ig loci and at TcR loci, involving the 
consensus sequences used for V-D-J somatic 
recombination of active Ig genes. In the other type, 
the translocation occurs at a switching site, so these 
cases may be associated with function of the 
system that switches expression from one C u gene 
to another. 

When c-myc is translocated to the Ig locus, its 
level of expression is usually increased. The 
increase varies considerably among individual 
tumors, generally being in the range from 2 to 10. 
Why does translocation activate the c-myc gene? 
The translocation event does not involve fixed sites, 
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but occurs at a variety of locations within a general 
region on each reoombsning chromosome. The 
event has two consequences: c-myc is brought into 
a new region, one in which an Ig or TcR gene was 
actively expressed; and the structure of the c-myc 
gene may itself be changed (but usually not involv- 
ing the coding regions). It seems likely that several 
different mechanisms can activate the c-myc gene 
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in ils new location (just as retroviral insertions 
activate c-myc in a variety of ways). 

The correlation between the tumorigenic 
plienotype and the activation of c-myc by either 
insertion or translocation suggests that continued 
high expression of the c-Myc protein is oncogenic. 
Expression of c-myc must be switched off to enable 
immature lymphocytes to differentiate into mature 
B and T cells; failure to turn off c-myc maintains the 
cells in the undifferentiated (dividing) state. The 
oncogenic potential of c-myc has been demon- 
strated directly by the creation of transgenic mice 
carrying a normal c-myc gene linked to an 
enhancer. Transgenic mice carrying a c-myc gene 
linked to a B lymphocyte-specific enhancer (the IgH 
enhancer) develop lymphomas. The tumors repre- 
sent both immature and mature B lymphocytes, 
suggesting that over-expression of c-myc is tumori- 
genic throughout the B cell lineage. Transgenic 
mice carrying a c-myc gene under the control of the 
LTR from a mouse mammary tumor virus, 
however, develop a variety of cancers, including 
mammary carcinomas. This suggests that 
increased or continued expression of c-myc trans- 
forms the type of cell in which it occurs into a 
corresponding tumor. Specificity of the tumor type 
may therefore depend on the mechanism used 
to activate c-myc\ it is not an intrinsic property 
of the gene. 

c-myc exhibits three means of oncogene 
activation: retroviral insertion, chromosomal 
translocation, and gene amplification. The com- 
mon thread among them is increased expression of 
the oncogene rather than a qualitative change in its 
coding function, although in at least some cases 
the transcript has lost the usual (and possibly 
regulatory) nontranslaled leader, c-myc provides 
die paradigm for oncogenes that may be effectively 
activated by increased (or possibly altered) 
expression. 

Every translocation generates reciprocal prod- 
ucts; sometimes a known oncogene is activated in 
one of the products, but in other cases it is not 
evident which of the reciprocal products has 
responsibility for oncogenicity. Also, it is not 
axiomatic that the gene(s) at the breakpoint have 



responsibility; for example, the translocation could 
provide an enhancer that activates another gene 
nearby. 

A variety of translocations found in B and T cells 
have identified new oncogenes. In some cases, the 
translocation generates a hybrid gene, in which an 
active transcription unit is broken by the transloca- 
tion. This has the result that the exons of one gene 
may be connected to another. In such cases, there 
are two potential causes of oncogenicity; the proto- 
oncogene part of the protein may be activated in 
some way that is independent of the other part, for 
example, because it is over-expressed under ils 
new management (a situation directly comparable 
with the example of c-myc)\ or the other partner in 
the hybrid gene may have some positive effect that 
generates a gain-of-function in the part of the 
protein coded by the proto-oncogene. 

One of the best characterized cases in which a 
U'anslocation creates a hybrid oncogene is provided 
by the Philadelphia (PIP) chromosome present 
in patients with chronic myelogenous leukemia 
(CML). This reciprocal translocation is loo small to 
be visible in the karyotype, but links a 5,000 kb 
region from the end of chromosome 9 carrying c-abl 
to the bcrregion of chromosome 22. The bcr (break- 
point cluster region) was originally named to 
describe a region of -5.8 kb within which break- 
points occur on chromosome 22. Different cases of 
CML have breakpoints at different locations within 
this region. 

The consequences of this translocation are sum- 
marized in Figure 59.10. The bcr region lies within 
a large (>90 kb) gene, which is now known as the 
bcr gene. The breakpoints in CML usually occur 
within one of two introns in the middle of the gene. 
The same gene is also involved in translocations 
that generate another disease, ALL (acute lym- 
phoblastic leukemia); in this case, the breakpoint in 
the bcr gene occurs in the first intron. 

The c-abl gene is expressed by alternative 
splicing that uses either of the first two exons. The 
breakpoints in both CML and ALL occur in the 
intron that precedes the first common exon. 
Although the exact breakpoints on both chro- 
mosomes 9 and 22 vary in individual cases, the 
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common outcome is the production of a transcript 
coding for a Bcr-Abl fusion protein, in which 
N-lerminal sequences derived from bcr are linked 
to c-abl sequences. In ALL, the 185,000 dallon 



fusion protein has -45,000 daltonsof the Bcr protein 
linked to c-Abl; in CML the fusion protein of 210,000 
daltons has -70,000 dahons of the Bcr protein. In 
each case, the fusion protein contains -140,000 
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daltons of the usual -145,000 c-Abl protein, that is, 
it has lost just a few N-lerminal amino acids of the 
c-Abl sequence. 

Why is the fusion protein oncogenic? It relies 
on an interaction between the N-tenninal region 
provided by bcr with the c-Abl protein. The bcr gene 
has a variety of sequence motifs related to proteins 
involved in signalling pathways, but the pertinent 
one is a serine/threonine kinase activity that is 
coded within the first exon. This aulophosphory- 
lates residues in this part of the protein, and the 
phosphorylation confers the ability to interact with 
a region of the c-Abl protein called the SH2 domain 
(we discuss the nature of SH2 domains later). This 



raises the possibility that the Bcr part of the pro- 
tein interacts with the c-Abl sequences, perhaps 
changing their conformation and activating a 
latent oncogenic potential. 

Changes at the N-lerminus are involved in the 
activation of oncogenic activity of v~abl, a trans- 
forming version of the gene carried in a retrovirus. 
The c-abl gene codes for a tyrosine kinase activity; 
this activity is essential for transforming potential 
in oncogenic variants. Deletion (or replacement) of 
the N-terminal region activates the kinase activity 
and transforming capacity. So the N-terminus 
provides a domain that usually regulates kinase 
activity; its loss may cause inappropriate activation. 



toss of tumor suppressors causes tiimor formation v , _: 



1 

| The common theme in the role of oncogenes in 
| turn oogenesis is that increased or altered activity of 
I the gene product is oncogenic. Whether the onco- 
gf gene is introduced by a virus or results from a 
| mutation in the genome, it is dominant over its 
f\ allelicproto-oncogene(s). Amutation that activates 
H a single allele is lumorigenic. Tumorigenesis then 
pi results from gain of a function. 
SI Certain tumors are caused by a different mech- 
B anism: loss of both alleles at a locus is lumorigenic. 
E Propensity to form such tumors may be inherited 
g through the germline; it also occurs as the result of 
K somatic change in the individual. Tumorigenesis 
S then results from loss of function. Such cases iden- 
M. tify tumor suppressors: genes whose products are 
%. needed for norma) cell function, and whose loss of 
i function causes tumors. The two best characterized 
^ genes of this class code for the proteins RB and p53. 
Retinoblastoma is a human childhood disease, 
involving a tumor on the retina. It occurs both as a 
I heritable trait and sporadically (by somatic muta- 
tion). It is often associated with deletions of band 
ql4 of human chromosome 15. The RB gene has 
% been localized to this region by molecular cloning. 
Figure 59.11 illustrates the situation. 



Retinoblastoma arises when both copies of the RB 
gene are inactivated. In the inherited form of the 
disease, one parental chromosome carries an alter- 
ation in this region, usually a deletion. A somatic 
event in retinal cells that causes loss of the other 
copy of the RB gene causes a tumor. In the sporadic 
form of the disease, the parental chromosomes are 
normal, and both RB alleles are lost by (individual) 
somatic events. 

Almost half the cases of retinoblastoma show 
deletions at the RB locus. In other cases, transcripts 
of the locus are either absent or altered in length. 
The protein product is absent from retinoblastoma 
cells. The cause of the tumor is therefore loss of 
protein function, usually resulting from mutations 
that prevent gene expression (as opposed to point 
mutations that affect function of the protein prod- 
uct). Loss of RB is involved also in other forms of 
cancer, including osteosarcomas and small cell 
lung cancers. 

What is the molecular function of RB? It interacts 
with a variety of other proteins, including several 
tumor antigens: SV40 T antigen, adenovirus El A, 
human papilloma virus E7. One possibility is that 
part of the oncogenicity of these proteins is due to 
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AMPLIFICATION OF CELLULAR ONCOGENES IN CANCER CELLS 
K. ALITALO 

FROM THE DEPARTMENT OF VIROLOGY, UNIVERSITY OF HELSINKI, HELSINKI, FINLAND 

ABSTRACT 

Regulatory or structural alterations of cellular oncogenes have been implicated in the causation of various 
cancers. Oncogene alteration by point mutations can result in a protein product with strongly enhanced on* 
cogenic potential. Aberrant expression of cellular oncogenes may be due to tumour-specific chromosomal 
translocations that dysregulate the normal functions of a proto-oncogene. Amplification of cellular onco> 
genes can also augment their expression by increasing the amount of DNA template available for ihc pro- 
duction of mRNA. It appears that amplification of certain oncogenes is a common correlate of ihi- progres- 
sion of some tumours and also occurs as a rare sporadic event affecting various oncogenes in different types 
of cancer. Amplified copies of oncogenes may or may not be associated with chromosomal abnormalities 
signifying DNA amplification: double minute chromosomes and homogeneously staining chromosomal 
regions. Amplified oncogenes, whether sporadic or tumour type-specific, are expressed at elevated levels 
in some cases in cells where their diploid forms are normally silent. Increased dosage of an amplified 
oncogene may contribute to the multistep progression of at least some cancers. 

KEY WORDS: CELLULAR ONCOGENES, GENE AMPLIFICATION, MULTISTEP CARCINOGENESIS, CLONAL SELECTION 
KARYOTYPIC ABNORMALITIES, DOUBLE MINUTE CHROMOSOMES, HOMOGENEOUSLY STAINING CHROMOSOMAL 
REGIONS 



DNA SEQUENCE AMPLIFICATION AND 
. CYTOGENETIC ABNORMALITIES 
IN TUMOURS 

Since its discovery in drug-resistant eukaryotic 
cells, somatic amplification of specific genes 
has been implicated in an increasing variety 
of adaptive responses of cells to environmental 
stresses (70, 79). Cytogenetic abnormalities, 
double minute chromosomes (dminrs) asso- 
ciated with DNA amplification had already 
been discovered in tumour cells before the 
discovery of dminrs and homogeneously stain- 
ing chromosomal regions (HSR:s) in cells se- 
lected for drug-resistance (12, 24, 49, 50, 56). 
In metaphase spreads, dmin:s appear as small, 
spherical, usually paired chromosome — like 
structures that lack a centromere (Fig. 2). HSR:s 
stain with intermediate intensity throughout 
their length rather than with the normal pat- 
tern of alternating dark and light bands in 
both trypsin-Giemsa (Fig. 3A) and quinacrine 
dihydrochloride-stained preparations. Both 
kinds of abnormalities are occasionally found 
in metaphases of freshly isolated cancer cells 
but not of normal cells (8). 

Dmin:s and HSR:s are apparently rare in 
tumour cells in vivo, although exact data are 



difficult to obtain since the abnormalities arc 
easily missed in routine cytogenetic analysis 
(8, 42). Dmin:s and HSR:s have been de- 
scribed in most types of in vitro-culturcd malig- 
nant tumour cells, with a notable frequency 
in neuroblastoma cell lines (11). Initial growlh 
in cell culture apparently selects for tumour 
cells that contain either dmin:s or HSR:s. 
Moreover, in culture dmin:s are frequently 
lost, concomitant with the appearance of 
clonal populations of cells that have devel- 
oped an HSR, suggesting that the two cyto- 
genetic abnormalities are alternative forms of 
gene amplification and that HSR:s may confer 
a selective advantage on cells over dmin:s (11. 
70). It has been assumed that HSR:s can break 
down to firm dmin:s and that dmin.s can in- 
tegrate into chromosomes to generate HSR:s 
(11, 23). Amplified genes may also occupy ab- 
normally banding regions, ABR:s (51, 59). Ex- 
perimental work on drug-resistant cells has 
shown that in the absence of a selection pres- 
sure (drug), dmhr.s and the amplified genes 
that they contain are lost, whereas amplified 
DNA in the form of HSR:s is retained in the 
cells (71). This is explained by the fact that 
dmin:s are segregated unevenly in mitosis an 
frequently get lost from the nucleus due to 
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TABLE 1 
Currently known oncogenes. 
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Retrovirus 
(example) 


Oncogene 
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Cellular location 
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RSV 

Y73V 

GR-FeSV 

Ab-MuLV 

FuSV 
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UR2V 


src 
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fgr 
abl 
fps/fes 

fes/fps 

ros 


Plasma membrane 
Plasma membrane 
Plasma membrane 
Plasma membrane 
Cytoplasm 
(plasma membrane?) 
Cytoplasm 
(cytoskeleton?) 


Tyrosine-specific 
protein kinases 
[fgr contains sequences 
homologous to actin) 


Class la {Cytoplasmic 
tyrosine protein kinases) 


AEV 

SM-FeSV 

MH-2V 

39U-MSV 

Mo-MSV 


erb-B 

fms 

mil/raf 
raf/mil 
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Plasma membrane 
and cytoplasmic 
membranes 
Plasma membrane 
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membranes 
Cytoplasm 
Cytoplasm 
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a growth factor receptor? 

? 
? 
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proteins) 
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Secreted 


PDGF-like growth factor 


Class 2 (Growth factors) 
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Ha-ras 
Ki-ras 


Plasma membrane 
Plasma membrane 


GTP -binding proteins 


Class 3 {Cytoplasmic 
GTP:ases] 
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their lack of centromeres, (49). HSR chromo- 
somes carry centromeres and are therefore 
divided equally at mitosis. If dmin:s and 
HSR:s contain amplified genes that encode 
growth-stimulating protein products, it would 
follow that the more stable chromosomal 
form, the HSR, confers a greater selective 
growth advantage for cells. Although dmin:s 
and HSR:s have been described predominant- 
ly in tumour cells selected for resistance to cy- 
totoxic drugs, it is also clear that dmin:s and 
ttSR:s may be present in cancer cells before 
ar -y form of therapy (8). It was in this setting 
that we and others first chose to explore the 
Possible amplification of cellular oncogenes. 
IBy definition, cellular oncogenes are normally 



innocent genetic loci which can be activated 
to transforming genes in various ways). 1 



DMIN:S AND HSR:S CONTAIN AMPLIFIED 
ONCOGENES 

Table 2 summarizes the somatic amplifica- 
tions of cellular oncogenes so far reported in 

1 It is not the purpose of this review to deal with all forms 
of DNA damage that have been found to activate eel* 
lular oncogenes. For the purpose of integrating the re- 
view into a coherent picture, however, the reader is 
given a list of known cellular oncogenes in Table 1 and 
the schematic Figure 1 illustrating the various ways in 
which the oncogenic potential of different proto-onco- 
genes can be activated. 
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Fie. 1. Activation of cellular oncogenes. The haploid complement of a proto-oncogene is schematically depicted m A 
composed of three exons (black boxes] in a segment of DNA. The different activated forms are ^ he ^«"^ u ^ 
in B~G. The abbreviation c-onc stands for cellular oncogene, v-onc viral oncogene, DNA sequences with associated 
strong promoter/enhancer functions are striated, and an actively transcribed gene is marked with "diations. B Acu* 
transforming retroviruses have the capacity to transduce cellular oncogenes {c-oncj into their genome, modi y them 
and reinsert their activated oncogenes (v-onc) into the genome of host animal cells as a part of their provira forms. 
The activity of the v-onc gene is greatly enhanced due to the associated promoter of the proviral long terminal repeat. 
Both increased dosage of the oncogene and structural mutations within its sequence may contribute to tumongenesis. 
C Slow transforming retroviruses without oncogenes replicate and reinsert their proviral copies into the host cell DNA 
during f fat ency period from infection to tumorigen esis. Tumor initiation through hyperplastic growth may begin, 
when the proving integrates sufficiently close to a proto-oncogene to activate it through promoter or enchancer func- 
tions It should be noted, however, that mutations have also been found in the oncogenes thus activated and that muU- 
tionai damage to other oncogenes has been described in the resulting tumors. D. n some mouse plasmacytomas a 
retrovirus-like DNA elemenf {directing the synthesis of the so-called intracisternal A- type parUcles IAPs) has been 
found in association with a transcriptionally activated oncogene c-mos. The IAP insertion also disrupts the 5 par of 
c-mos (64). E. In humans, as well as in animals, chromosome translocations may place proto-oncogenes into transcrip- 
Sly active regions of chromatin, where they may be activated. The details of this mechanism have not been worked 
out" but it is belifved to occur for c-myc and ciu genes in Burkitt lymphomas and Philadelphia-chromosome positive 
leukemias, respectively {35. 40): F. Increased amounts of oncogene-specific RNA and protein can also result from an 
access of DNA template for transcription acquired through oncogene amplification. The I^"' ™" 
primarily on this mechanism. G. Mutationally activated oncogenes have been ^J^J^^^ h ^^ 
nant tumours. Oncogene loci activated by somatic structural mutations are revealed by transection experiments 
where they are introduced into genetic background of nontumourigenic cultured immorta lized cells * »uch 
transforming loci have been cloned and many of them belong to the c-ras oncogene family. It should b out 
that both structural mutations and either increased expression or activation of a complementing oncogene may be 

required to achieve a fully tumorlgenic phenotype (44). 



tumour cells. Although the sampling of tu- 
mours is at present small, the finding of 
known cellular oncogenes among amplified 
DNA represented by dmin:s and HSR:s of 
cancer cells is provocative. Amplification has 
been found to affect at least five out of twenty 
known cellular oncogenes and the degree of 
gene amplification varies from five to many 
hundred-fold over the single haploid copies 
found in normal cells (see also ref. 18). The 
first amplification reported involved the c-myc 



oncogene (see Table 1) in a promyelocyte leu- 
kaemia cell line HL-60 (20, 25}. The degree of 
c-myc amplification is between 8—32 fold 
both in the HL-60 cell line and in primary leu- 
kaemic cells from the patient (20, 25). Origi- 
nal clonal lines of HL-60 were later found to 
contain some dmhv.s in culture but their num- 
ber was insufficient to establish any clear cor- 
relation with amplified c-myc. Such a correla- 
tion, however, was discovered for c-myc am- 
plification in a neuroendocrine cell line from 
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DMIN.'S 



COLo'^m I""" 16 ? hrom °s<"nes (arrowheads) in 
res 0 ]?J 2 ° DM co l 0 2 «™oma cells. A. The dmin:s are 
lh" nun " L a ,' r tf do,s . an »<»g normal chromosomes in 
ras fluorescent , benzmudazole-stained preparation B-D. 
™"fi« t u>n of dmin:s by differential centrifugations. B 
starting material, C. Chromosome fraction. D. Purl: 
"W dmin:s (Donna George and the author, unpublished 
data and ref. 52). 



Ju 0n t , carcinoma ' C 0L0 320 |5J. In these 
«is, the approxtmately 30-fold amplified 
myc copies were mapped either to HSR:s of 
r IS? ch J. omos °™ (5. Fig 3B) or to dmin:s 
i ied c P^i 1 '" 8 on the P ar «<:ular subline stud- 
DiL Ce dmin:s were already present in the 
I] K t r°r, c , ells 1 from colon carcino- 

Uo ffiLV 8 llkdy that C ' m ^ had also ^en 
P SSsi • dunn § ^owth of the tumour in vivo. 
■iCni Y ' am P lified ^ies of the c-Ki-ms on- 
9 mo?, W6 i e mapped 10 dmin:s and HSR:s of 
C s ?vfc T? COr i! Cal tumor Yl < 74 )- ^ ex- 
fi m Ch 0r L chanse$ in other oncogenes 

Ce "f h u S SinCe revealed ampUfica- 
«s that do not show up as dmin:s or HSR:s. 



the G ^;^ e «c?° gC i! e0US J. y Stflinin « (HSR) in 

the G-banded HSR-marker chromosome comprise a ma- 

EfiS ? as evolved from 311 X-chromosome 

[52 and unpublished data of C. C. Lin and the authorl B 

^A^ Ut J°- f0ld am P lified c °P*s of £ ™ oncogene 

The ,aUer 15 shoxvn here by in situ-hybridization 
(5, 52). 



Thus for example, the c-myb oncogene is 
amplified in a characteristic marker chromo- 
nf hw 3 , C ? lo , n carcinoma without evidence 
of HSRrs (ref 6, Fig. 4) and in other tumours, 
the amplified c-abl and c-myc oncogene loci 
map to abnormally banding regions (ABR:s) in 
translocated or resident chromosomal seg- 
ments, respectively [59, 76). 

TRANSLOCATIONS AND 
REARRANGEMENTS MAY ACCOMPANY 
ONCOGENE AMPLIFICATION 

The evolution and progression of the karyo- 
type of tumour cells is complicated (see ref 
68). Concomitant with amplification, DNA se- 
quences acquire an increased mobility in the 
genome with extrachromosomal intermediates 
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TABLE 2 

Sporadic and tumour-specific amplification of cellular oncogenes. ' 



i 



Tumour cells 



Onco- 
gene 



Fold 

amplified 



Chromosomal 
location of 
amplified gene 



Expression 
elevated 



Remarks 



References 



Sporadic: 

HL60 {acute promyelo- c-myc 20 x 
cytic leukaemia, M3) 

COLO320 (colon c-myc 30 x 
carcinoma] 

Yl (adrenocortical c-Ki-ras 50 x 
tumour) 

COLO201/205 (colon c-myb lOx 
carcinomaj 



K562 (chronic myelo- 
genous leukaemia, CMLJ 



c-aW 10 x 



Sq(ABR) 
dmin, HSR 
dmin, HSR 
marl 

mar(ABR} 



A431 (epidermoid 
carcinoma) 



c-erbB 15— 20 x n.d. 



ML1— 3 (acute myeloid c-myb 5-10x 
leukaemia, M2J 

SK BR-3 (breast c-myc 10 x 
carcinoma) 

SEW A (polyoma virus- c-myc 30 x 
induced mouse tumourj 



Lu-65 (lung giant cell c-myc 8x 
carcinoma} c-Ki-ras 10 x 

Primary leukemic cells c-myc 33 x 
from an acute myeloid 
leukemia (M2J patient 



n.d. 

n.d. 
n.d. 



n.d. 
n.d. 
n.d. 



Yes 
Yes 
Yes 
Yes 

Yes 
Yes 

Yes 

Yes 
Yes 



n : d. 
n.d. 
n.d. 



Amplification present in 20, 25. 59 

primary ieukaemic cells 

Part of the amplified 4, 5, 52 

c-myc sequences rearranged 

Levels of p 21« KI ^ a 74 

protein elevated 

Patient treated with 4, 6,88 

5-fluorouracil prior to 
culruring of the tumour cells 
C k coamplified in the 21, 22, 41, 

marker that may be derived 54, 76 
from chromosome 22, c<ibl 
protein-associated tyrosine 
kinase activated 

Amplification linked to 82 
chromosome 7 translocation 
and sequence rearrangements 
Amount of protein product, 
the EGF receptor, elevated (see 36} 
Abnormalities of chromo- 34. 61, 91 
some 6q22— 24, where 
c-myb is normally located 

43 



Cells have dminis depending 
on culture conditions; 
c-myc amplification 
correlates with growth 
as a tumour 

At least some copies of 
c-Ki-ras mutated 



Tumour-specific: 

small-cell lung cancer c-myc up to 80x n.d. 

1,-myc, 
N-myc 

Neuroblastomas N-myc up to 250 x dmin, HSR 



Yes 
Yes 



Glioblastomas 



c-eroB - 



Manfred 
Schwab, 
personal 
commu- 
nication 
80 

Unpublished 
data of the 
author and 
A. de la 
Chapelle 



Most amplifications in the 53, 69 
variant phenotype of SCLC 

N-myc also amplified in 14, 48, 72, 
primary tumours of 73,. 75 

advanced grade 

Rearrangements also found Josef 
personal communication Schlessin- 

ger, per- 
sonal com- 
munication 



n.d. = not determined, mar . marker chromosome, M2, M3 refer to the French-American-Brittish classification of 
acute myeloid leukemias. 

• At least one case of oncogene amplification in normal germ-line cells has been found (18). 



visualized as dmin:s ( transpositions and trans- 
locations to other chromosomal segments, etc. 
(see 70 for references]. There may not be pre- 
ferred chromosomal sites for the apparent 
reintegration of dminrs as HSR:s (75|. In at 



least one case, however, an oncogene may 
have been caught amplifying in situ in its resi- 
dent chromosomal site (59). The finding of 
moderately amplified oncogenes also in chro- 
mosomal sites lacking HSR;s suggests that 
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marl 




A B 



, Fig. 4. Localization of amplified c-myb in COLO 201/205 
cells by in silu hybridization. Shown is a characteristic, 
large marker chromosome (marl) with G-banding (A) 
and associated c-myb autoradiographic grains |BJ, Note 
ihe absence of HSR:s. Marl has probably evolved from 
chromosome number 6, the resident site of the c-myb on- 
cogene in normal cells (34, 88, 91). (Robert Winqvist and 
the author, unpublished data}. 



(oncojgene amplification may be more com- 
mon than the structural alterations shown by 
chromosome banding and microscopy (6, 88). 

In at least three cases reported amplification 
has been accompanied by a DNA rearrange- 
ment of the oncogene (5, 20, 82). In the colon 
carcinoma COLO 320 both damaged and nor- 
mal versions of the c-myc gene are amplified 
(5). Although individual cell clones have not 
yet been examined, our unpublished experi- 
ments suggest that the same dmin-containing 
cells harbor and express both normal and re- 
arranged forms of c-myc. The normal version 
?f the amplified gene, however, predominates 
m COLO 320 cells containing HSR:s; the rear- 
ranged version is present only in what ap- 
pears to be a single copy (Fig. 5). In the 
«uonic myeloid leukaemia (erythroleukae- 
to 1 *) cell line K562 an amplified DNA segment 
consists of portions of both the c-abl onco- 
gene and the immunoglobulin Q locus (76). 

both cases abnormal transcripts are prod- 
ded from the rearranged amplified oncogenes 
J* 6 and ref. 22). In K562 cells, the abnor- 
mal c-abl oncogene product has also been acti- 
ated as a tyrosine protein kinase (41). It is not 
? 0Wn . however, whether structural alter- 

° f the genes P receded amplification or 
nether they were acquired during the process 
. 8 e ne amplification. It seems likely that a 
' ^omosomal translocation of c-abl to the 
KSfio P receded DN A amplification in the 
002 cells, since ail amplified copies were al- 

cem ea r rranged I 21 '' with the chan S e 

reminis- 

flt of the Philadelphia translocation |t(9, 22)) 
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1.4- 



Sstl 

Fig. 5. Amplification and rearrangement of c-myc in 
COLO 320 cells. 10 /tg of cellular DNA was digested with 
Sst I, electrophoresed, blotted and probed with a v-myc 
Pst I fragment (ref. 2, left panel]. Fragments of 2.7 kbp 
and 1.4 kbp are seen in both normal and amplified c-myc 
DNA. The 3.3 kbp fragment is derived from a DNA seg- 
ment of unknown origin translocated to the 5' region of 
c-myc with a concomitant deletion of its first exon (unpub- 
lished data of Manfred Schwab and the author). HSF, 
human skin fibroblasts; DM, COLO 320 DM cells; HSR, 
COLO 320 HSR cells. Different amounts of DNA from 
COLO 320 DM cells as indicated were mixed with calf 
thymus DNA to give 24 ^g of total DNA, cleaved with 
Sst I, electrophoresed, blotted and probed with a frag- 
ment of 3' human c-myc sequences. The intensities of the 
2.7 kbp c-myc fragment in different samples were com- 
pared to assess its copy number, estimated to be about 

30 {5). 



found in most chronic myeloid leukaemia tu- 
mours (35, 66—68). Although they have not 
been sequenced, other reported cases of am- 
plified oncogenes are apparently normal on 
basis of mapping with restriction endonu- 
cleases (see Table 2). Therefore we cannot at 
present view mutation as a necessary com- 
panion of oncogene amplification. 



THE MECHANISMS OF GENE 
AMPLIFICATION 

The mechanisms of gene amplification and 
the structure of the amplified DNA have been 
worked out mainly in experimental settings 
involving selection for drug-resistance in cell 
culture (70). Although the mechanisms are 
still incompletely known and may vary in dif- 
ferent cases, some general features have 
emerged. 

A spontaneous degree of illegitimate DNA 
replication seems to exist in normal cells so 
that various segments of DNA are replicated 
more than once during a single cell cycle (37). 
In unselective conditions this DNA is probab- 
ly lost e.g., through formation of micronucleae 
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c-myc 

RNA 
DM HSR 



<)2.3kb 



Fig. 6. Comparison of the electrophoretic mobilities of 
c-myc mRNA:s from COLO 320 DM and HSR cells. The 
size of the normal c-myc mRNA is 2.3 kb. The rearranged 
c-myc locus in DM ceils (see Fig. 5) seems to be pre- 
dominantly expressed giving rise to a shortened RNA. 



because the newly synthesized extra copies of 
DNA are not covalently linked to chromo- 
somal DNA of mitotic cells (65, 71). If, how- 
ever, there is a selection pressure to retain an 
increased gene dosage, progressive multipli- 
cation of gene copy number results. The in- 
cidence of cells bearing amplified genes under 
conditions of cytotoxic selection can vary by 
two orders of magnitude and is greatly in- 
creased by the presence of mitogenic sub- 
stances (hormones or tumour promoters) dur- 
ing selection (10, 84, 85) or certain carcino- 
genic or cytotoxic agents before selection [15, 
55, 79, 80, 81, 85). An interesting hypothesis 
suggested by Varshavsky (84, 85) supposes that 
the origins of DNA replication "fire" (initiate 
replication) illegitimately several times during 
a single cell cycle and that this kind of "repli- 
con misfiring" may be increased by substances 
such as tumour promoters and mitogenic hor- 
mones (10, 84, 85). Mariani and Schimke (55) 
point out that most of the cytotoxic agents that 
increase the incidence of gene amplification 
are inhibitors of DNA synthesis. Aberrant 
replication is known to take place after tran- 
sient inhibition of DNA synthesis arid this re- 
sponse can lead to gene amplification (46, 47, 
55, 90). Mitogenic hormones probably increase 
disproportionate DNA replication, but they 



also enhance the colony forming efficiency of 
drug-resistant cells in selective conditions (10). 1 

According to the studies of Axel and his col- 
laborators (65), the multiple cycles of un- 
scheduled DNA replication at a single locus : 
during a single ceil cycle result in a structure ' 
schematically outlined in Fig. IF. The hydro- I 
gen-bonded amplified copies of DNA depicted 
in Fig. IF must resolve into a tandem linear 
array before the next mitosis. This may well 
occur by homologous recombination between 
any one of several repeated sequences within 
the amplified domain (45, 65). Part of the re- 
combinations would lead to extrachromosomal 
circles possessing an origin for replication (16, 
62); these could be the precursors of dmin:s. 
The unequal recombinations mean that the 
resolved linear structure consists of tandemly 
repeated but heterogeneous units. According 
to Axel's model a gradient of amplification is 
formed so that centrally located sequences are 
amplified more than sequences distal to the 
origin of replication (65). This also has, in fact, 
been found to explain the large, complex DNA 
domain amplified in neuroblastoma cells in 
vivo (38, see also below). 

The chromosomal site of integration of 
transfected genes significantly affects the fre- 
quency and cytogenetic result of their experi- 
mentally induced amplification (83). The 
amplification frequency in some transfor- 
mants has been found to be 100-fold that of 
the others (83). This suggests that there also 
are preferred chromosomal positions for am- 
plification of host cellular genes and that chro- 
mosomal rearrangements may facilitate gene 
amplification by positioning chromosomal se- j 
quences in a favorable array. In respect of the ( 
structural properties of the sequences involved 
in gene amplification, recombinatorially active 
regions have been implicated in experimental 
cases. DNA rearrangements involving restric- 
tion fragment length polymorphisms and 
variation in gene copy number have been de- 
tected in the human genome between clusters 
of short repetitive interspersed DNA sequences 
called Alu family DNA-sequences (17). Such 
inter-Alu sequences have also been detected 
in an extrachromosomal DNA form, including 
covalently closed circles (17, 78), The copy 
number of inter-Alu sequences apparently va- 
ries in an age- and tissue-specific manner (17, 
78), but any comprehensive analysis of the 
phenomenon in human tumours is not yet 
available. It is also not yet clear whether these 
kinds of repetitive sequences are involved in 
generating amplified oncogene sequences in 
dmin:s or HSR:s in tumours. 
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Fig 8 A. Indirect immunofluorescence for p21'*™ in Yl 
DM cells. Similar fluorescence of the plasma membrane 
was obtained for the Yl HSR cells. Inset lb) fh^contro 
staining with normal rat serum and inset (cl staining ot 
normal rat kidney cells with the monoclonal ant.body 
against p21. 
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Fig. 7. Elevated levels of the p21'*-» protein in Y l cells 
(74) The Yl DM and HSR cells which harbor a 50-fold 
'amplified c-Ki-ras oncogene (74) and control cells were 
labeled with ["^methionine and the p2l««* protein 
l^ immunoprecWated. as detailed (74) with normal 
rat serum iNRS) or rat monoclonal anti-p21 serum, rhe 
proteins were electrophoresed in a 15 % polyacrylamide 
eel in the presence of SDS. In addition to a major p21 
. band, a labeled band at about 16 kd was present m the 
' immunoprecipitates. The amount of radioactivity in p2l 
i was about 50 fold that in normal rat kidney eel s. The 
• : Kristen sarcoma virus-transformed rat kidney cells |oo- 
1 tained from the American Tissue Culture Collection) also 
/yielded unexpectedly low amounts of the v-M-ras 
R protein. 



CARCINOGEN-INDUCED GENE 
'AMPLIFICATION AND CLONAL 
SELECTION OF CANCER CELLS 

» 

[ Although cell sorting experiments have 
shown a basal spontaneous rate of gene ampli- 
cation in eukaryotic cells (37), this can be 

Increased severalfold by metabolic inhibitors 
*r cytotoxic agents (15, 37, 70, 81, 85). In 
many respects the latter response is remims- 
1 c ent of the so-called SOS-response elicited in 
Weriae by noxious stimuli (see 28). In a te- 
kological context, the rapid induction of gene 
Amplification that apparently occurs frequent- 
ly through extrachromosomal intermediates 
m ay provide cells with genetic material for 
^□sequent selective pressures operating in 
harmful conditions (60). In cancer cells, the 
Mechanism may enhance the emergence of 



clonal populations of cells with increasingly 
malignant properties (58, 60). Such genetic 
instability of cancer cells is clearly enhanced, 
leading to the rapid evolution of increasingly 
malignant tumour cell populations (19, 58). A 
serious question of practical importance is 
whether drug resistance in treated patients 
also selects cells that have an enhanced ability 
to amplify (onco)genes important for growth 
and progression of the tumour (84, 85). It is 
also possible that some of the carcinogenic in- 
sults caused by mutagens are only expressed 
as a result of subsequent amplification events 
induced by tumour promoters (84, 85) or facil- 
itated by hormones in, say replicating epithelial 
cells (10). The persistence of dmin:s in some 
tumours suggests that there is a selection 
pressure for their retention (8, 9, 11, Zdj. 
Amplified DNA in dmin:s must contain an 
origin for DNA replication (62) and must be 
selected for in daughter cell populations, 
where it is unevenly segregated (71). In the 
absence of such a selection pressure dmm:s 
are lost (71). In at least one study the length of 
an HSR has been found to increase during a 
selection of malignant ceils for enhanced 
tumour igenicity (30). 

The amplified c-erbB gene in A431 cells codes 
for epidermal growth factor receptor (27). The 
abundant amounts of receptor protein on 
A431 cell surface may, however, provide the 
cells with an abnormal growth response (31). 
A naive supposition is that the amplified se- 
quences in dmin:s and possibly in HSR:s of 
tumours contain growth-promoting genes (see 
36 for references). This seems to fit well with 
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Fig. 9. Construction of a v-myc expression vector. A synthetic linker (ATGAGAATTCATCAT) containing a translational 
initiation codon was inserted downstream from the trp promoter in the pSRC ex 16 Rl expression vector described 
previously (see ref. 3). Approximately one half of the v-src sequences coding for the aminoterminal portion of ppSO 4 **" 
protein were then deleted and the remaining portion ligated in translational codon frame with the synthetic ATG. A 
Hinc II fragment of v-mvc from plasmid clone MC 38 (nucleotides 320—685 in the v-roye sequence in ref. 2} was ligated 
downstream form remaining v-src sequences in continuity with its reading frame. The resulting product contained 3 
amino acids from the synthetic linker, 252 amino acids encoded by the 756 base pair fragment from Sma I to Pvu II 
restriction sites in v-src DNA, 122 amino acids from the v-myc and 6 amino acids (corresponding to nucleotides 

2968-2085} from the pBR322 vector |3). 
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| recent findings on amplified oncogenes, though 
j in many cases the search for an amplified on- 
cogene is still continuing. Even positive find- 
j i n gs do not mandate a role for amplified cel- 
lular oncogenes, however, because the do- 
main of amplified DNA is inevitably much 
1 larger than a single genetic locus (e.g. 38). 



ENHANCED EXPRESSION OF AMPLIFIED 
ONCOGENES 

In all cases where they have been studied, the 
amplified oncogenes have been found abund- 
antly expressed at the RNA level, roughly in 
proportion to the amount of DNA amplifica- 
tion (see Table 1). Described cases of elevated 
RNA expression include examples of abnor- 
mal (5, 22) and ectopic (6) transcription. In at 
least four cases this enhancement is not li- 
mited to synthesis of RNA (31, 33, 41, 74, 82). 
The Yl cells that have amplified c-Ki-ras con- 
lain exceptionally large amounts of its protein 
product situated on the plasma membrane 
(ref 74, Fig. 7 and 8). High amounts of the 
c-myc encoded protein are also found in 
COLO 320 cells that have amplified the gene 
(33). The myc oncogenes have recently been 
shown to encode nuclear proteins (ref. 1, 3, 
26, 29, 32, 33, Fig. 9-11). Both the expression 
of the c-myc mRNA (39) and the subcel- 
lular localization of myc proteins are linked to 
the cell cycle (ref. 89, Fig. 12). It may be that 
elevated expression of specific c-myc func- 
tions is necessary for cell cycle progression 
and the growth transformation aspect of the 
phenotype of cancer cells that may contribute 
to tumour progression (7, 36). Elevated ex- 
pression of c-myc has been shown to replace 
jn part platelet-derived growth factor in in- 
duction of competence for DNA replication 
PI- Generally, enhanced expression of an on- 
cogene could be a necessary prerequisite for 
acquisition of a growth advantage by cells 
having extra copies of the gene. This effect 
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Fig. 10. E. coli 294 was iransfected with the hybrid v-src 
v-myc plasmid outlined in Fig. 9 and ampicillin-resistant 
bacterial colonies were checked for the production of a 
43,000 m.w. bacterial v-myc protein after induction by 
growth to different optical densities in minimal essential 
medium (M9. induction + ) or complete medium (LB, in- 
duction—) (3). 



could also be the principal contribution of 
amplification to tumourigenesis. 

TUMOUR CELL AND STAGE SPECIFICITY 
OF ONCOGENE ACTIVATION AND 
AMPLIFICATION 

Tumour cell specificity of oncogene amplifi- 
cation has been found in three malignancies. 
The c-myc, L-myc or N-myc oncogene is ampli- 
fied in most cases of the variant form of small- 
cell lung cancer cells (53, 69), c-erbB is ampli- 
fied in several glioblastomas (Josef Schlessin- 
ger, personal communication) and the putative 
N-myc oncogene is amplified in about half of 
grade III and IV neuroblastomas (14, 72, 73, 
75). In addition to HSR:s, small-cell lung can- 
cers and neuroblastomas frequently show a 
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Fig. 12. Fluorescent staining for DNA and myc protein in 
myelocytomatosis virus-transformed quail cells. In in- 
terphase cells, the myc protein is confined to the nucleae. 
In the mitotic cell, myc fluorescence is distributed 
throughout the cell unlike fluorescence for chromatin, 
which is compacted to chromosomes in the metaphase 
plate. In fact, there is less myc fluorescence associated 
with chromatin than with the rest of the cell. DAPI, dia- 
minopheny I indole DNA stain. The anti-myc protein rab- 
bit antiserum was used in a 1/200 dilution (ref. 89). 



deletion of a portion of the short arm of chro- 
mosome 1 (13) and chromosome 3 (86, 87), 
respectively, in karyological examination. 
Two kinds of changes have also been described 
in different neuroblastoma oncogenes. The 
first is a mutation in the N-ras gene, an ac- 
tivated oncogene that was discovered because 
of its relation to other ras genes and trans- 
forming activity in transfection .experiments 
(77). The second is amplification of a distant 
homoiogue of the c-myc gene called N-myc (72 ( 
73, 75). Although the transforming potential 
of the N-myc gene has not yet been established, 
its consistent presence in a core segment of 
amplified neuroblastoma DNA (38, 57, 72, 73, 
75) and its elevated expression in most reti- 
noblastomas (48) suggests its oncogenic nature. 

Taya et al. (80) have recently described a 
human lung giant cell carcinoma grown in 
nude mice, where both c-Ki-ras and c-myc on- 



cogenes were amplified about 10-fold. Besides, 
sequencing studies indicated that at least 
some of the amplified c-Ki-ras copies were 
also mutationally activated in the 12th codon. 
These results fit to the multistage theory of 
cancer development and progression (see 58). 
Apparently co-operating lesions in cellular on- 
cogenes accumulate during tumour growth 
and selection and increase the malignant po- 
tential of the tumour cells (44). 

When does oncogene amplification come in- 
to play during tumourigenesis? Gene amplifi. 
cation may not be any initiating event in carci- 
nogenesis. Amplification and enhanced ex- 
pression of c-myc and N-myc may occur during 
the progression of human carcinoma of the 
lung and neuroblastoma cells to a more malig- 
nant phenotype (14, 53, 73). There may be, 
however, no mandatory sequence of onco- 
gene amplifications for the genesis of any par- 
ticular tumor. Amplification of an oncogene 
could play its part in malignant progression of 
already initiated cells whenever it happened 
to occur. 
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Abstract 

The human epidermal growth factor (EGF) receptor is known 
to be homologous to the v-erb B oncogene protein of the avian 
erythroblastosis vims. Overexpression of the EGF receptor 
gene in A431 epidermoid carcinoma cells is due to gene 
amplification. In this study, a variety of squamous cell carci- 
nomas were examined and one, SCC-15, contained high levels 
of the EGF receptor as determined by immtmoprecipitatioo via 
an EGF receptor-specific polyclonal antibody. Using a cloned 
EGF receptor complementary DNA as a probe, the level of 
EGF receptor UNA was found to be elevated four-fold in SCC- 
15 relative to normal cultured keratinocytes. When the same 
probe was used to identify EGF receptor gene fragments on a 
genomic DNA blot, the SCC-15 cell line was shown to possess 
an EGF receptor gene copy number amplified four to five 
times. Gene amplification results in the enhancement in the 
level of the EGF receptor in several carcinomas and could be 
responsible for the appearance of the transformed phenotype 
in these cells. 

Introduction 

The epidermal growth factor (EGF) 1 stimulates growth and 
elicits a wide variety of rapid and delayed responses by binding 
to high-affinity cell-surface receptors which are 170-kD glyco- 
proteins (1). Recently, EGF receptor peptides have been se- 
quenced and found to be homologous to the avian erythro- 
blastosis virus erb B oncogene product (2), suggesting that the 
EGF receptor gene is the human c~*ro B oncogene. A43I 
epidermoid carcinoma cells possess a very large number of 
EGF receptors (3), and the EGF receptor gene is amplified 
~30-fo!d (4-6). This amplification is responsible for the 
overexprcssion of the EGF receptor protein in these cells 
(4-6). 

A cell culture system has been developed permitting serial 
cultivation of keratinocytes, whose growth is modulated by 
EGF (7). Such methods have been used to establish cell lines 
from squamous cell carcinomas of the oral epithelium (8). 
Because of the role of EGF in keratinocyte development, we 
quantified EGF receptor protein and RNA in several squamous 
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cell carcinomas. One cell line, SCO 1 5, was found to contain 
high amounts of receptor protein and RNA, and a four- to 
fivefold amplification of the gene. 

Methods 

The squamous cell carcinomas established by Rheinwald and Beckett 
(8) were obtained from, and maintained according to the American 
Type Culture Collection (Rockville, MD). 1623 was originally designated 
as SCC-15; 1628 as SCC-25; and 1629 as SCC-9 (8). Norma! human 
esophageal epithelial cells were grown as reported (9). Maintenance of 
other cell lines was as described elsewhere (10). Proteins were labeled 
with [^methionine and immunoprectpitated as previously described 
(10). PolyA+ RNA was isolated by guanidine isothiocyanate solubili- 
zation and CsQ ccntrifugation (I I), and oligo(dT}-affinity chromatog- 
raphy. RNA (Northern) blotting was performed as described (II, 12). 
High molecular weight DNA was isolated (4) and analyzed by DNA 
(Southern) blotting (4, 10, 13). The EGF receptor complementary 
DNA (cDNA) clone pE7 was constructed and isolated from an A43I 
cDNA library (1 1). DNA fragments were "P-tabeled by nick translation. 

Results 

A large number of cell lines were initially screened for EGF 
receptor levels by determining their ability to be killed by an 
EGF-pseudomonas exotoxin conjugate, a technique described 
previously (10). Several squamous cell carcinomas were found 
to be relatively sensitive to the EGF-toxin conjugate, including 
SCC-25, SCC-9, and particularly SCC-15, all derived from the 
human tongue (8). These three cell lines were labeled with 
[ 3J S] methionine, and their extracts immunoprecipitated with 
a goat polyclonal antibody to the EGF receptor, affinity- 
purified as described (10). When compared with A431 cells, 
which make very large amounts of the EGF receptor, SCC-25 
and SCC-9 make moderate amounts and SCC-15 high amounts 
of the receptor (Fig. 1, lane a vs. e. g. and c). Quantitation of 
the immunoprecipitation data revealed that SCC-15, SCC-25, 
and SCC-9 make 41, 15, and 4% of the amount of EGF 
receptor made by A431 cells, respectively. 

Because SCC-15 cells had high levels of receptor, polyA+ 
RNA was isolated from these cells, electrophoretically fraction- 
ated on agarose, and analyzed by RNA (Northern) blotting, A 
cloned A431 cDNA (pE7) encoding the EGF receptor (11) 
was 32 P-labeled and used as a hybridization probe to visualize 
EGF receptor RNAs. Fig, 2 A shows that SCC-15 contains 
both the 10- and 5.6-kilobase species of EGF receptor RNA. 
The levels are approximately four- to fivefold higher than 
those found in either KB or A498 kidney carcinoma cells; 
these cell lines were previously found to possess readily de- 
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Ffjure i. Autoradiograph of 
clectrophoresed ,5 S-labeled 
EGF receptor protein (arrow) 
immunopreripitated by either 
E, affinity-purified goat poly- 
clonal EGF receptor antibody, 
or N, normal serum. Molecu- 
lar weight markers are at left. 



tectable levels of both receptor RNAs(lO). Fig. 2 B shows that 
cultured human epithelial cells contain EGF receptor-specific 
RNA (HEIA, lane e) whose levels are higher than an early 
passage human fibroblast D551 (lane/), equivalent to A498 
(lane g)> but much lower than SCC-15 (lane b). 

To determine if an elevated gene copy number was asso- 
ciated with enhanced expression of the EGF receptor gene in 
SCC-15 cells, genomic DNA was isolated from normal cultured 
epithelial cells (HEIA) and SCC-15 cells, digested with Hindlll, 
electrophoreticaHy fractionated, and subjected to DNA blotting 
analysis. An EGF receptor cDNA (pE7) was used as a hybrid- 
ization probe to identify receptor DNA fragments. Fig. 3 A 
reveals that the SCC-15 genome contains four- to fivefold 
amplified EGF receptor gene sequences relative to normal 
epithelial cells (lane a vs. b). Analysis of 0-actin gene fragments 
on the same filter by hybridization to a chick actin cDNA 
probe indicated that equal amounts of DNA were loaded per 
well (data not shown). Digested SCC-15 DNA had to be 
diluted about fourfold (Fig. 3 B, lane e) to approximate the 
signal intensity of receptor DNA fragments from SCC-25, 
SCC-9, and KB cells (lanes g~i). The KB cell EGF receptor 
gene is known not to be amplified (10). 

Discussion 

It may be significant that A43 1 carcinoma cells are not unique 
in their possession of amplified EGF receptor genes. We report 
here that the EGF receptor gene in squamous cell carcinoma 
SCC-15 is amplified four- to fivefold relative to normal epithelial 
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Figure 2. RNA blot analy- 
sis of polyA+ RNAs using 
the ^P-labcled EGF recep- 
tor cDNA probe pE7, {A) 
and {B) are autoradiographs 
from two separate gels. 
Sizes are in kilobases (left). 
5 (lanes a, 6, d> and g) or 
10 (lanes c. e. and/) /ig of 
RNA were loaded. 
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Figure 3. DNA blot analysis of ///^dill-digested genomic DNAs 
using the pE7 probe (see Fig 2). {A) and {B) are autoradiographs 
from two separate gels. Sizes are in kilobase pairs (right). 10 /ig of 
DNA was loaded except in (F), lanes d-f, which represent serial 
dilutions of the 1623 DNA shown in lane c 



cells. The amplification of the EGF receptor gene may cause 
the initiation or maintenance of the malignant state in some 
human cells. 

Previously, we reported that a variety of transformed cell 
lines synthesize relatively high amounts of both EGF receptor 
protein and messenger RNA (10). It is conceivable that a 
moderate or even a small increase in the level of the EGF 
receptor leads to a change in the cellular phenotype, as has 
been demonstrated for the sre gene product (14). If this 
hypothesis is correct, then even a minor amplification of the 
EGF receptor gene copy number could contribute to the onset 
of tumorigenesis. Hendler and Ozanne (15) have examined 
lung squamous cell carcinomas and found that they contain a 
2.5-5-fold increase in EGF receptor levels. 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 

Toitoen F. 0mtoftj:§, Thomas ThyKjaerU, Frederic M. Waldmanll, Hans Wolf**, 
and Julio E. Cells** ' 



Gjaftj and loss of chromosomal material is characteristic phenomenon at both the transcription and translation Jewels. 

tempted to address this question in pairs of noninvasive ™ ® % of ,ndivWual 9 enes ln soIW tumors 

and Invasive human bidder tumors using a combination T revealed a good corfBfation between gene dose and 

of technology that Included comparative genomic hybrid- mRNA or proteIn Ievels ln case of c-erb-B2, cyclfn cH % 

teation, Wgh density oligonucleotide array-based monitor- emst ' ^ N-myc (3-5). However, a high cycfin D1 protein 

ing of transcript levels (5600 genes), and high resolution ex Preeslon has been observed without simultaneous anv 
two-dfmensional gel electrophoresls/the results showed^ Ilf,cation (4). and a low level of c-myc copy number In- 

that there is a gene dosage effect B.at fn some cases crease was observed without concomitant c-myc protein 

superimposes on other regulatory mechanisms. This ef- overexpresslon (6). . 

scripts. Areas with loss of DMA, on the other hand cham ^ stic of ^tag^ of disease progression. In the 
showed either reduced or unaltered transcript level^Be- °\ non " Invaslve pTa ^ransWonal cell carcinomas (TCCs), 
cause most proteins resolved by two-dimensional gels deludes loss of chromosome 9 or parts of it, as well as 
are unknown It was only possible to compare mRNA and 1088 01 Y ,n rna,e& ln minimally Invasive pTI TCCs» trie fol- 
protein alterations In relatively few cases lowing alterations have been reported: 2q- ( Hp-, lq+, . 
abundant proteins. 5rt£rm few exceptions we found a good 11*113+, 17q+, and 20q+ (7-12). It has been suggested mat 
correlation (p < 0.006) between transcript alterations and these regions harbor tumor suppressor genes and onco- 
protein levels. The Implications, as wen as limitations, genes; however, the large chromosomal areas Involved often 
iSS!7S!l H^*** dm Motecu,ar * 0e " u!ar many genes, making meaningful predictions of the 
fW? zqoz. functional consequences of losses and gains very difficult 

In mis Investigation we have combined genome-wide tech- 

. Aneuplofdy Is a common feature of most human cancers 22^^ 

^ Z**^ of CW* Blocher** Molecular r** 3^ 

oostlc Laboratory and "Department of Urology, Aarhus University ' 

Hospital, Skejby, DK-8200 Aarhus N, Denmark, 1AROS Applied Bio- bcpprimhsttai ponrprn mco 

technology ApS, Gastav WiedsveJ 10, DK-800O Aarhus cT Denmark, EXPERIMENTAL PROCEDURES 

8UCSF Cancer Center and Department of Laboratory Medicine, Uni- Material- Bladder tumor biopsies were sampled after Informed 

versiry of Cafifomia, San Francisco, CA 94143-0808, and institute consent obtained and after removal of tissue for routine palhol- 

of Medical Biochemistry and Danish Centre for Human Genome Re- °®? examInation - ^h* microscopy tumors 335 and 532 were 

search, Ote Worms Alle 170, Aarhus University, DK~8000 Aarhus C, ™ experienced pathologist as pta (superficial papillary), 

Denmark . 

Received. September 26, 2001, and In revised form, November 7, 1 The abbreviations used are: CGH, comparative genomic hybrid- 

D..K.i*k~* urn o , P> . Izatlon; TCC, transitional cell carcinoma; LOH, loss of heterozygosity: 
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Re. 1» DNA copy number and mRNA expression level. Shown from left to fight are chromosome (ChrX GGH omfftes. Gene location and 
'2^^!^ r 63 '^!^ eXpraSSton chromosome- A, expro^wof m^^ C^^u^^33^ 

Srt!^ 0 ?^ «™terpart tumor 335. B, expression of mRNA in Invasive tumor 827 compared XZ^Inv^ 

m. TheboW curveto the ratio profile represents a mean of four chromosomes and is surrounded by thinc^MlS^^^ 
^^^^i^ 0 ^ 3 a ^ value of 1 (no change), and the vertical lirZ ^ 

Sl!! 0 ^!!! 51*^ 10 tumor profile. The coW 6am represents one gene ^Syt 

"*^"^^^J™ V* name of the gene can be seen at www.MDLDK/sdataJitml). The bars holca^e^W^^L of 

to expression aJong the chromosome; the colors Indicate that at least half of the genes were up-reguteted (ofecfc). at least naif of toe otnes 

t^^JE^ ™* T uncto ^ (orange). If a gene wasSt m onHnhe TsaSeSJe^ 

Z^i^ 9 ^ 03 mOT V^!L a ° han "- A 24oW ^ *** chosen 35 mb ^Ponded to one standardSton ihVdatte 
determination of -1300 genes. Centromeres and heterochromatic regions were excluded from data analysis. 



grade I and a, respective^, tumors 733 and 827 were staged as pT1 
Qnvasrve Into submucosal 733 was staged as solid, and 827 was 
staged as papillary, both grade III. 

mRNA Preparation— l\&u* biopsies, obtained fresh from surgery, 
were embedded immediately In a sc^urn-guanldlnium mtocyanate 
solution and stored at -80 °C. TotaJ RNA was Isolated using the 
RNAzol B RNA Isolation method (WAK-Chemie Medical GMBH). 
poW+ ^RNA was isolated by an o8go(dT) selection step pigotex 
mRNAWtjQiagen). 

cRNA Preparation- 1 jig of mRNA was used as starting material. 
The first and second strand cDNA synthesis was performed using the 
Superscript® choice system flnvftrogen) according to the manufac- 
turer's instructions but using an o&go(dT) primer containing a T7 RNA 
.polymerase binding site. Labeled cRNA was prepared usfng the ME- 
GAscrlp® in vitro transcription kit (AmbJon). 8iotfrHabeIed CTP and 



OTP (Enzo) was used, together with unlabeled NTPs in the reaction. 
Following the In vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (Qiagen). 

Army Hybridization and Scanning— Array hybridization and scan* 
nlng was modified from a previous method (13). 10 hq of cRNAwas 
fragmented at 94 °C for 35 mln in buffer containing 40 rrw Tris 
acetate, pH 8.1, 100 mw KQAc. 30 rm MgOAo. Prior to hybridlzaUon, 
the fragmented cRNA in a 6x SSPE-T hybridization buffer (1 wNaC! 
10 m» Trfs, pH 7.6, 0.005% Triton), was heated to 85 °C for 5 mm, 
subsequently cooled to 40 'C, and loaded onto the Affymetrfx probe 
array cartridge. The probe array was then incubated for 18 h at 40 °C 
at constant rotation (60 rpm). The . probe array was exposed to 10 
washes In 6x SSPE-T at 25 °C followed by 4 washes In OSx SSPE-T 
at 50 °C. The bfottoylated cRNA was stained- with a streptavlcfin- 
phycoerythrin conjugate, 10 ^g/ml {Molecular Probes) In 8x SSPE-T 
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for 30 rota at 25 °C followed by 1 6 washes In6x SSPE-T at 25 °C. The 
probe arrays wen? scanned' at 560 nm using a confocaJ laser scanning 
microscope (made for Aflymetnx by Hewlett-Packard), The readings 
from the quantftatlvs scanning were analyzed by Affymetrtx gene 
expression analysis software. 

Mierosatefflrfe Ana/ysfc— Mjcrosatearte Analysis was performed as 
described previously (14). WcrosateH&es were selected by use of 
wwwj>cWJUm,nih,gdv/genemap98 t and primer sequences were ob- 
tained from the genome data base at www.gdb.org. DMA was extracted 
from tumor and blood and amplified fay PCR ha volume of 20 jtl for 35 
cycles. The ampUcons were denatured and electrophoresed tor 3 h In an 
ABI Prfem 377. Data were collected bi the Gene Scan program for 
fragment analysis, toss of heterozygosity was defined as 
of one allele detected In tumor ampHcons compared with Wood. 

Proteomfc Analysis-TCCs were minced Into small pieces and 
homogenized in a small glass homogenlzer in 0-5 ml of lysis solution. 
Samples were stored at -20 °C untO use. The procedure for 2D gel 
electrophoresis has been described in detail elsewhere (15, 16). Gets 
were stained with silver nitrate and/or Coornassle Brilliant Blue. Pro- 
teins were Identified by a combination of procedures that Included 
rrucrosequencing, mass spectrometry, twp-dimensfonaJ gel Western 
Immunoblotting, and comparison with the master two-dimenstoial gel 
Image of human keratlnocyte proteins; see Wobase.dk/cgi-bWceOs, 

CGH-HyWdizatlon of differentially labeled tumor and norma! DNA 
to normal metaphase chromosomes was performed as described 
previously (10). Fluoresceln-labeled tumor DNA (200 ng) f Texas Red- 



labeled reference DNA (200 ng), and human Cot-1 DNA (20 /tg) were 
denatured at 37 °C for 5 mln and applied to denatured normal met- 
aphase slides. Hybrkfizatlon was at 37 °C for 2 days. After washing, 
the slides were oounterstaJned with at5 pQ/ml 4,6-dtamitfno~2-phe- 
nyHndolo fn an anti-fade solution. A second hybrkfizatton was per- 
formed for all tumor samples using fororesce&Habeted reference DNA 
and Texas Red-labeled tumor DNA (inverse labeling) to confirm the 
aberrations detected during the initial hybridization. Bach CGH ex- 
periment also included a normal control hybrldtzatlon using fluores- 
cein- and Texas Red-labeled normal DNA. Digital Image analysis was 
used to Identify chromosomal regions with abnormal fluorescence 
ratios, indicating regions of DNA gains and losses. The average 
greercred fluorescence intensity ratio profiles were calculated using 
four Images of each chromosome (eight chromosomes total) with 
normalization of the groomed fluorescence Intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4,6-diamldino-2*phenyfhoole band- 
ing patterns. Only Images showing uniform high mlensfty fluores- 
cence with minimal background staining were analyzed All 
centitMneresy p arms of acrocentric cfiromoeomes, and heterochro- 
matlc regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization— The CGH analysis 
Identified a number, of chromosomal gains and losses in the 
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Table I 

Correlation between alterations detected by CGH and by expression monitoring 

Top, CGH used as Independent variable (ff CGH alteration - what expression ratio was found); bottom, altered expression used as 
independent variable (jit expression alteration - what CGH deviation was found). 



CGH alterations 



Tumor 733 vs. 335 
Expression change clusters 



Concordance CGH alterations 



Tumor 827 vs. 532 
Expression change dusters 



Concordance 



13 Gain 10 Up-reguJation 

0 Down-regulation 

3 No change 
10 Loss 1 Up-regulation 

5 Down-regulation 

4 No change 



77% 
50% 



10 Gain. 
12 Loss 



8 Up-regulaUon 
Ottown-regulatton 

2 No change 

3 Up-regutation 

2 Down regulation 
7 No change 



Expression change clusters 



Tumor 733 vs. 335 
CGH alterations 



Concordance Expression change clusters 



Tumor 827 vs. 532 
CGH alterations 



80% 
17% 

Concordance 



16 Up-regulation 
21 Down-regulation 
15 No change 



11 Gain 

2 Loss 

3 No change 
1 Gain 

8 Loss 

12 No change 
3 Gain 

3 Loss 

9 No change 



38% 
60% 



17 Up^egutation 
91towiweguiatlon 
21 No change 



10GaJn 
5 Loss 

2 No change 

0 Gain 

3 Loss 

8 No change 

1 Gain 
3 Loss 

17 No change 



59% 
33% 
81% 



two Invasive tumors (stage pT1 , TCCs 733 and 827), whereas 
the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p-, 9q22-q33- t and and 7+, 9q-, 
and respectively. Both Invasive tumors showed changes 
(1q22-24+, 2q14.1-qter- t 3q12-q13.3-, 6q12-q22~, 
9q344, 11q12-q13+, 17+, and 20q11.2-q12+) that are typ- 
ical for their disease stage, as well as. additional alterations, 
some of which are shown in Fig. 1. Areas with gains and 
tosses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
in the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were observed at 1q23 In TCC 733 ff=Ig, 1/Q and 
20q12 in TCC 827 (Fig, 18). 

mRNA Expression In Relation to DNA Copy Number— The* 
mRNA levels from the two invasive tumors. (TCCs 827 and 
733) were compared with the two non-invasive counterparts 
(TCCs 532 and 335). this was done In two separata experi- 
ments in which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1,800 genes that yielded a signal on the arrays 
were searched In the Unigene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way it was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the Indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the Invasive versus the non-Invasive counterpart Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fold were regarded as Informative (Fig. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the QGH method Is very low, and some of the 
outlier dat£ may be because of the fact that the boundaries of 
the chiomosomaJ aterrattons are not known at high resolution. 

Two sets of calculations were made from the data For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations in these 
chromosomal areas. In general* areas with a strong gain of 
chromosomal material contained a duster of genes having 
Increased mRNA expression. For example, both chromo- 
somes 1q21-q25, 2p arid 9q, showed a relative gain of more 
than 100% in DNA copy number that was accompanied by 
increased mRNA expression levels In the two tumor pairs (Rg. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied by an Increased level of transcripts In both 
TCCs 733 (77%) and 827 (80%) (Table I, fop). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression In several cases, and were often, regis- 
tered as having unaltered RNA levels (Table I, top). The Inabil- 
ity to detect RNA expression changes in these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In me second set of calculations we selected expression 
alterations above 2-fold as the Independent variable and es- 
timated the frequency of CGH alterations In these areas. As. 
above, we found that increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected In areas with unaltered CGH ratios (Table I. bottom). 
Furthermore, as a control we looked at areas with no atter- 
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Fki.2. Correlation between maximum CGH aberration and the ability to detect expression change by oligonucleotide array 
monitoring. The aberration is shown as a numerical -fold change to ratio between invasive tumors 827 (A) and 733 (♦) arid their norWrrvasive 
counterparts 532 and 335. The expression change was taken from the Expression Bne to the nght In Rg. 1, which depicts toe resulting 
expression change for a given chromosornaJ region. At least half of the mRNAs from a given region have to be either up- or down-regulated 
to be scored as an expression change. All chromosomal arms In which the CGH ratio plus or minus one standard deviation was outside the 
ratio value of one were Included 



atlon In expression. No alteration was detected. by CGH In. 
most of these areas fTCC 733, 60% and TCC 827, 81 %; see 
Table I, bottom)- Because the ability to observe reduced or 
Increased mRNA expression clustering to a certain chromo- 
somal area dearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations In the 
regions showing CGH changes against the ability to detect a 
change In mRNA expression as monitored by the oligonucleo- 
tide arrays (Rg, 2Kfi>r both tumors TCC 733 (p < 0.015} and 
TCC 827 <p < 0.00003) a highly significant correlation was 
observed between the level of CGH ratio charge (reflecting 
. the DNA copy number) and alterations detected by the array 
based technology (Fig. 2^ Similar data were obtained when 
areas with altered expression were used as Independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1.67 to 2.0-fo!d (Tabfe I, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect mat 
loss of an allele may only lead to a 5096 reduction in expres- 
sion level, which is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent, 

McrosateMe-based Detection of Minor Areas of Loss- 
es—In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Rg. 1 , TCC 733 
chromosome 1q32, 2p21, and 7q21 and q32, 9q34, and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two mlcrosatellltes positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at bom 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Rg. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript increase/decrease/ta- 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Rg! 1A). As Indicated above, the mRNA decrease observed In 
the middle of the chromosomal gain was because of LOH, 
implying *hat one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method. . 

In both TCC 733 and TCC 827, the telomerte end of chro- 
mosome 11p showed a normal ratio in the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two mlcrosaMKes (D11S1760, D11S922) 
positioned dose to MUC2, IGF2, and cathepsln D Indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed in TCC 733 at 
chromosomes 3q24 f 11p11, 12p12.2, 12q21.1, and 16q24 
and in TCC 827 at chromosome 11p15.5 f 12p11, 15q11.2, 
and 18q12 was also examined for chrornbsornal losses using 
mlcrosatellltes positioned as close as possible to the gene loci 
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Fkj, 3. Microsate2rte analysis of loss of heterozygosity. Tumor 
733 showing loss of heterozygosity at chromosome 1q25. detected 
(a) by D1S215 dose to Ku class I histocompatibility antigen (gene 
number 38 .to Rg. 1), (b) by 01S2735 dose to cathepsln E (gene 
number 41 In Rg. 1), and (c) at chromosome 2p23 by D2S2251 close 
to general ^pectn^ feens number 11 on Rg. 1) and of (d) tumor 627 
showing toss of heterozygosity at chromosome 18q12 by S18S1118 
dose to mitochondrial 3-oxoacyl-coen2yme A thidase (gene number 
12 In Rg- 1). The upper curves show the electropharogram obtained 
from normal DNA from leukocytes (N), and the tower curves show the 
eJectropherogram from tumor DNA (7). In all cases one allele Is 
partially lost to the tumor ampllcon. 

showing reduced mRNA transcripts. Only the mlcrosatelllte 
positioned at 18q12 showed LOH (Rg. 3), suggesting that 
transcriptional down-regulation of genes In the other regions 
may be controlled by other mechanisms. 

Relation between Changes In mRNA and Protein Levels- 
2D-PAGE analysis, In combination with Coomassle Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating In areas away from the edges of the pH 
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Ra. 4. Correlation between protein levels as judged by 20- 
PAGE and transcript ratio. For comparison proteins were dlvfcted In 
three groups, unaltered In level or up- or down-regulated horizontal 
axis). The mRNA ratio as determined by ollgonudeotfde arrays was 
plotted for each gene [vertical axis), a; mRNAs that were scored as 
present In both tumors used for the ratio calculation; A. mRNAs that 
were scored as absent In the Invasive tumors {along horizontal axis'} or 
as absent In non-Invasive reference (fop of figure). Two different 
scaflngs were used to exdude scaling as a confounder, TCCs 827 
and 532 (JlA) were scaled with background suppression, and TCCs 
733 and 335 (9Q were scaled without suppression. 80th compari- 
sons showed highly significant (p < 0.005) drfferences'ln mRNA ratios 
between the groups. Proteins shown were as follows: Group A (from 
fe/f), phosphoglucomutase 1 , glutathione transferase dass m number 
4, fatty add-bmding protein homotogue, cytokeratln 15, and cyto- 
keratln 13; 8 (from tefr), fatty add-Wndlng protein homotogue, 28*Da 
heat shock protein, cytokeratln 13, and calcycttn; C|from tefl), o-eno- 
lase, hnRNP B1, 28-kDa heat shock protein, 14-3-3-^ and 
pre-mRNA splldng factor; D, mesotheBal keratin K7 (type II); £ (from 
top), glutathione S-transferase-ir and mesotheBaJ keratin K7 (type. II); 
F(from top and feft), adenylyt cydase-assodated prolan, E-cadherin, 
keratin 19, calgizzarln, phosphogtycerate mutase,.ennexfn IV, cy- 
tosketetal y-actln, hnRNP A% totegra! membrane protein caJnaxtn 
(IP90). hnRNP H, brain-type dathrln light chain-a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonudeoproteto A/B, 
translationaify controlled tumor protein, liver gjycerato^hya>3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na*K- 
ATPase 0-1 subunlt; G, (from top and lefty, TCP20, calgizzarln, 70- 
kDa heat shock protein, caJnaxIn. hnRNP H, cytokerattrt 15, ATP 
synthase, keratin 1 9, .trioeephosphate tsomerase, hnRNP F, fiver glyc- 
eraldehyde-3-phosphatase dehydrogenase, glutathione ^-transfer- 
ese-ir, and keratin 8; H (from fefl), plasma gelsbfln. autbanKgen cal- 
reticuHn, thlomdoxin. and NAD + -dependent 15 hydroxyprostaglandln 
derrydrogenase; / {from top), prolyl 4-hydroxyfase p^ubunh, cyto- 
keratln 20, cytokeratln 17, prohibition, and fructose 1,6-bTpbos- 
Phatase; J armexln II; K, annaxto IV; L (from top and /eft), 90-kDa heat, 
shock protein, prolyl 4-hydroxyiase /3-suburffl, a-enolase, GRP 78, 
cydophUfn, and cofflln. 

gradient, and having a known chromosomal location, were 
selected for analysis In the TCC pair 827/532. Proteins were 
identified by a combination of methods (see "Experimental 
Procedures 0 ). In general there was a highly significant corre- 
lation (p < 0.005) between mRNA and protein alterations (Rg. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto- 
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Fso. 5. Comparison of protein and transcript levels in invasive 
and non-lnvas?ve TCCs. The upper part of the figure shows a 2D gel 
(fefr) and the oligonucleotide array fright) ofTCC 632. The red rectan- 
gles on the upper get highlight the areas that are compared below. 
Identical areas of 2D gets of TCCs 532 and 827 are shown below, 
dearly, cytokeratlns 13 and 15 are strongly down-regulated In TCC 
827 {red annotation). The tile on the army containing probes for 
cytokeratjn.lS Is enlarged below the array (red arrow) from TCC 532 
and Is compared with TCC 827. The upper row of squares 4n each tile 
corresponds to perfect match probes; the tower row corresponds to 
mismatch probes containing a mutation (used for correction for un- 
speriflc binding). Absence of signal Is depicted as Mack, and the 
higher the signal the fighter the color. A high transcript level was 
detected* In TCC 532 (6161 units) whereas a much lower level was 
detected In TCC 827 (absence of signals), for cytokeratln 13, a high 
transcript level was also present In TCC 532 (15659 units), and a 
much tower level was present in TCC 827 (623 units). The 2D gels at 
the bottom of the figure fl&fl) show levels of.PA-FABP and adlpocyte- 
FABP In TCCs 335 and 733 (invasive), respectively. Both proteins are 
down-regulated In the invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript A medium transcript level was de- 
tected In the case of TCC 335 (1277 units) whereas very low levels 
were detected In TCC/ 733 (166 units), /£F« Isoelectric focusing. 



keratins encoded by genes on chromosome 17(FIg. 5) -the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected in TCCs 733 and 335, and of these 1 9 
correlated (p < 0.005) with the mRNA changes detected using 
the arrays (Fig, 4). For example, PA-FABP was highly ex- 
pressed in the non-invasive TCC 335 but lost in the Invasive 
counterpart (TCC 733; see Fig, 5). The" smaller number of 
proteins detected in both 733 and 335 Was because of the * 
smaller size of the biopsies that were available. 

11 chromosomal regions where CGH showed aberrations 
that corresponded to the changes tn transcript levels also 
showed corresponding changes In the protein level (Table II). . 
These regions included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratlns 17 and 20, annexlns II and (V, and the fatly 
acid-blndlng proteins PA-FABP and FBP1. Four of these pro- 
teins were encoded by genes In chromosome 17q, a fre- 
quently amplified chromosomal area in invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DMA content having 
lost some chromosomal parts and gained others. The present 
study provides some evidence as to the effect of these gains 
and losses on gene expression In two pairs of noninvasive 
and Invasive TCCs using high throughput expression arrays 
and proteomlcs, in combination with CGH. in general, the 
results showed that there Is a dear Individual regulation of the 
mRNA expression of single genes, which In some cases was 
superimposed by a DMA copy number effect Inmost cases, 
genes located In chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
tosses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of. detection. In several cases, how- 



. Table II 



Proteins whose expression /eve/ correlates with both mRNA end gene dose changes 


Protein 


Chromosomal location 


Tumor TCC 


CGH alteration 


Transcript alteration" 


Protein alteration • 


Annaxln II 


1q21 


733 


Gain 


Abs to Pres* 


increase 


Annexln IV 


2p13 


733 


Gain 


3.9-Fotd up 


Increase 


Cytokeratln 17 


17q12-q21 


827 


Gam 


3.8-Fbld up 


Increase 


Cytokeratln 20 


17q21.1 


827 


Gain 


5.6-Fold up 


Increase 


(PA-JFABP 


8q21.2 


827 


toss 


10-Fold down 


Decrease 


FBPT 


9q22 


827 


Gain 


2.3-Fold up 


Increase 


Plasma getsotin 


9q31 


827 


Gain 


AbstoPres 


increase 


Heat shock protein 28 


15q12-q13 


827 


Loss 


2.5-Fold up 


Decrease 


Prchfortm 


17q21 


827/733 


Gain 


3 J-/23-Fold up* 


Increase 


ProtyM-hydroxyl 


17q25 


827/733 


Gain 


6.7-/1.6-Fotdup 


Increase 


hnRNPBI 


7p15 


827 


Loss 


2.5-Fold down 


. Decrease 



■ Abs, absent; Pres, present 

b In cases where the corresponding alterations were found in both TCCs 827 and 733 these are shown as 827/733. 
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ever, an increase or .decrease in DNA copy number was 
associated with do novo occurrence or complete loss of tran- . 
script, respectively. Some of these transcripts could not be 
. detected in the non-Invasive tumor but were present at rela- 
tively high levels In areas with DNA amplifications in the inva- 
sive tumors (eg. In TCC 733 transcript from cellular Ilgand of 
annexfn II gene (chromosome 1q21) from absent to 2670 
arbitrary tmlts; In TCC 827 transcript from small prollne-rich 
protein 1 gene (chrombsome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area Indicates an Increased 
likelihood of gain of chromosomal material in this area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may lie In the loss of controlled 
methylation In tumor ceils (17-19). Tbus, It may be possible, 
that In chromosomes with increased DNA copy numbers two 
. or more alleles could be demethyiated simultaneously leading 
to a higher transcription level, whereas in chromosomes with 
losses the remaining allele could be partly methylated, turning 
off the process (20, 21). A recent report has documented a 
ptoidy regulation of gene expression in yeast, but in this case all 
the genes were present in the same ratio (22), a situation that is 
not analogous to that of cancer celts, which show marked 
chromosomal aberrations, as well as gene dosage effects- 
Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors. In pTa tumors,.these include 9p-, 9q~, 1 q +♦ Y- 
(2, 6). and ln.pT1 tumors, 2q-,11p-. 11q- t 1q+, 5p+ f 8q+, 
17q+, and 20q+ (£-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33- and 
9q- and Y-, respectively. Ukewise, the two minimal invasive 
pTf tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1q22-24 
amplification (seen in both tumors), 11q14-q22 loss, the latter 
often Pnked to 17 q+(both tumors), and 1q+ and 9p-, often 
linked to 20q+ and 11 q13+ (both tumors) (7-9). These ob- 
servations Indicate that the pairs of tumors used in this study 
exhibit chromosomal changes observed In many tumors, and 
therefore the findings could be of general. Importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
•20 megabases it is only possible to get a crude picture of 
chromosomal instability using this technique. Occasionally, 
we observed reduced transcript levels close to or inside re- 
gions with increased copy numbers. Analysis of these regions 
by positioning heterozygous microsatellltes as close as pos- 
sible to the locus showing reduced gene expression revealed 
loss of heterozygosity in several cases. It seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use ofcDNA mlcroarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneuploldy and cannot predict whether 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic Imprinting has 
an impact on the expression level In normai cells and Is often 
reduced In tumors. However, the relation between Imprinting 
and gain of chromosomal material Is not known- 

We regard it as a strength of this Investigation that we were 
able to compare Invasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very dose, and probably may represent successive steps In 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available ft was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression Is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and LOH data that showed a remarkable 
similarity even between tumors and distant metastasis (1 0, 23). 

.In the few cases analyzed, .mRNA, and protein levels 
showed a striking correspondence although in some cases 
we found discrepancies that may be attributed to translatibnal 
regulation, posMranslatkmal processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranstated mRNA pools, which are associated with few 
translationalfy inactive rtbosomes; these pools, however, 
seem to be rare £4). Protein degradation, for example, may 
be very important. in the case of polypeptides with a short 
half-fife (a<?. signaling proteins). A poor correlation between 
mRNA arid protein levels was found in liver cells as deter- 
mined by arrays and 2D-PAGE $25), and a moderate correla- 
tion was recently reported by kteker etaL {26) In yeast 
Qnterestlngly, our study revealed a much better correlation 
.between gained chromosomal areas and Increased mRNA 
levels than between loss of chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in transcript) One possible 
explanation could be that by losing one allele the change In 
mRNA level Is not so dramatic as compared with gain of 
material, which can be rather unlimited arid may lead to a 
severatfold increase In gene copy number resulting in a much 
higher Impact on transcript level. The latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2-fold level so as notto be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may In the future allow detection of lesser than 2-fold 
alterations in transcript levels, a feature that may facilitate the 
analysis of the effect of loss of chromosomal areas on tran- 
script levels. 
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' In eleven cases we found a significant correlation between 
: DNA copy number, mRNA expression; and protein level Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area In chromosome 17q*. Whether DNA 
copy number Is one of the mechanisms behind alteration of 
these eleven proteins Is at present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated Is the large 
extent of protein modification that occurs after translation! 
requiring Immunoidentificatfon and/or mass spectrometry to 
correctly Identify the proteins in the gels. 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather In 
the future by combining state of the art techniques that follow 
the pathway from. DNA to protein (26). Here, we used a. tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on microanays wfth many thousand radiation 
hybrid-mapped genes will Increase the resolution and Informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived from genes with 
known locations, and 2D gel analysis to obtain information at 
the post-translationaJ level, a clearer and more developed un- 
derstanding of the tumor genome will be forthcoming. 
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ABSTRACT* 

Genetic changes underlie tumor progression and may lead to cancer* 

specific expression of critical genes. Over 1100 publlcatloni have de-.. 
; scribed the use of comparative genomic hybrldiiarJon (CGH) to analyze 

the pattern of copy number alterations in cancer, but very few of the genes 

affected are known* Here, we performed high-resolotioii CGH analysis on 

cDNA mkroarrays fa breast cancer and directly compared copy number 

and mKNA expression levels of 13,824 genes to quantitate the impact of ^ 

genomic changes on- gene expression. We 'identified and mapped the 

boundaries of 24 Independent amplicons, ranging in she from OJ to 12 

Mb, Throughout the genome, both high* and low-level copy number 

changes fend a substantial impact on gene expression, with 44% of the- ' ' 

highly amplified genes showing overexpression and.4 <KS% of the highly 

pverexpressed genes being amplified. Statistical analysis- with random 

peimiitation tests Identified 270 genes whose expression levels across' 14 
. samples were systematically attributable to gene, amplification.. Tfcese 

indnded most previously described amplified genet in breast cancer and ' 

many novel targets for genomic' alterations, Including the BOXB7 gene, 
.the presence or which in a novel ampUcon at 17q2l3 was validated in 

102% of primary breast cancers and associated witfi poor patient prog- 
nosis. In conclusion,' CGH on cDNA microaxrays revealed hundreds of. 

novel genea whose over-expression is attributable to gene amplification. 

Theie genes may provide insights to the clonal evolution and progression 

pt breast cancer and highlight promising therapeutic targets. 

• <■ 

INTROBUCTION. ' 

Gene expression patterns revealed by cDNA rmcroarraya have 
facilitated classification of cancers into biologically distinct catego- 
ries, some of which may explain the clinical behavior of the tumors 
(1-H5). Despite this progress in diagnostic classification, the. molecular 

mechanisms ^underlying gene expression patterns in cancer have re- rExfite&^ra^ 

Fig. I. Impact of gdic copy innnbcx on global genc^ocpressioa levels. A, percentage of 
over- and vn¥lcrexprcsacd=gcocs (7 axis) according to cojJy manbcc«tioj QCaxbi. 
Threshold values wed for dVep- and unu^rexnressftm were >Z184 (global upper 754 of 
the cDNA rafios) and <0.482« (global tower 7% of the expression ratios). B, percentage 
of amplified and deleted genes according to expression tatxov Threshold values for 
' amplification and deletion were >U and <0i7. * ' 





•' Accumulation of genetic defects is thought to underlie the clonal 
.evolution of cancer. Identification of the genes that mediate, the effects 
of genetic' changes may be. important by highlighting transcripts mat 
are actively involved in tumor progression. Such transcripts and their 
encoded proteins would be ideal targets. for' anticancer therapies, as 
demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as ERBB2 wdEGFR tf, 8), in breast cancer and 
other solid tumors. Besides amplifications of known oncogenes, oyer 



Received 5/29/02; accepted 8728702. 

lie costs of pubheatibn of this article, were defrayed in part by the payment of page 
charges. Tins article must therefore be hereby v^ed -advertisement in accordance with 
IS US.C. Section 1734 solely to indicate this fact. • • 

1 Supported in part by the Academy of Finland; Emil ^altonen Foundation, the Finnish 
Qmccr Society^ the Phkanmaa Cancer Society, the Pidammaa Cultural Foundation, the 
Famish Breast Cancer Group, the Foundation for the Development of Laboratory Med- 
icine, the Medical Research Fund of the Tampere University Hospital, the Foundation for 
Commercial- and Technical Sciences, and the Swedish Research Council . 

2 Supplementary data for this article are available at Cancer Research 'Online (httptf 
cancerTt*^acTjourxials.org). 

3 Contributed' equally to this work. 

4 Tb whom requests for reprints should be addressed, at Laboratory of Cancer Genet- 
ics, Institute of Medical Technology, lenkkerlijankfitu 6, fiN-33520 Tampere, Finland. 
Phbae: 35S-32474125; Fax: 338-32474168; E-maiL armeJcallioniemi@uta.fi. 



20. recurrent regions of DNA amplification have been mapped in 
breast cancer by CGH 5 . (9, 10),. However, mese amplicons are often 
large and poorly defined, and their impact on gene expression remains 
urinowiL 

We hypothesized that genome-wide identification of those gene 
expression changes- mat. are attributable to 'underlying gene copy 
number alterations would highlight transcripts that are. actively in- 
volved in the causation or maintenance of .the malignant phenotype. 
To identify such 'tianscripts, we applied a combination of cDNA and 
CGH microarrays to: (a) determine the global impact that gene copy 
number variation plays in breast cancer developmentand progression; 
and (b) identify and characterize those genes, whose inRN A expres- 



3 The abbreviations used are: CGH, comparative genomic hybridization; FISH, fluo- 
rescence in situ hybridization; RT-PCR, reverse transcription-PCR. 
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sibn 13 most significantly associated with amplification of the corre- 
sponding genomic template. 

MATERIALS AHD 'METHODS • 

Breast Cancer dai I3nei Fourteen breast cancer eel) lines (BT-20, BT- 
474, HCC1428, HsSSSt, MCF7, MbA-361/MDAr43_6, MDA-453, MDA-468, 
SKBR-3, T-47D, UACC812, ZR-75-J, and ZR-75-30) were obtained faunae' 
Aniencaa Type Culture Collection (Manassas; VA). Cells were grown under 
recommended culture conditions. Genomic DNA and mRNA were isolated 
wmg standard protocols. 

Copy Number and Expression Analyses by cDNA Micro arrays. The 
preparation and printing of the* 13" 824 cDNA clones on glass slides Were 



were excluded from the analysis and were treated as missing Values. The 
distributions of florescence ratios were uscd.to define cutpoints for increased/ 
decreased copy numte.XJenes with CGH ratio >M3 (reinserting the upper 
5% of the CGH ratios across all experiments) were considered to be amplified, 
and genes with ratio <0;73 (representor^ the lower 5%) were considered to be 
deleted. . . 

Statistical Analysis of CGH and cDNA Mteroarray Data. To evaluate * 
the influence of copy number alterations on gene expression, we. applied the 
following statistical approach. CGH and cDNA calibrated intensity ratios were 
log-ttansfbrmed and nonmUfredttsing median centering of the values in each 
cell line. Furmerrnore, CDNA ratios for each gene across all 14 cell lines were ' 
median centered. For each gene, the CGH data were represented by a vector 
that was labeled ! tor amplification (ratio, >1.43) and 0 for no amplication. 



juqjuuuwu buo pnnung oi me u,B24 cuna clones on glass slides were ™. 77^"** * * w a»P"ncanon ^mno, ana u rot no amplmcafion. 

performed as described (11-13). Of these clones, i44 represented uncharac- . Am p li _ fii ^ on was correlated wxft gene expression using the signal-to-noise 
terfeed expressed sequence tags, and the remainder corresponded to known * ^^ C8 0)- We calculated a weight, w r for each gene as-foHpws: 
genes. CGH experiments on cDNA mjcroarrays were done si described (14, . " . - 

15). Briefly, 20 /tg of genomic DNA fiom breast cancer cell lines and normal m^.-m^ 



- human WBCs were digested for 14-18 h with Alt&mA Rsd (Life Technol- 
ogies, lno, Rockville, MD) and purified by phennl/cMoroform extraction. Six .J 

of digested cell line DNAs were* labeled with C$tf-dUTP (Amersham . . where m slt <r fil and m& o^ denote the means and SDs for me expiesaori 

Phatmacia) and normal DNA with Cy5*dUTP (Amersham Pharmacia) using levels for amplified and nonamplified cell lines, respectively. To assess the 

tl^icprirne Labeling kit (Life Technologies, inc.). Hybridization (14, 15) and. statistical significance of each weight; we performed 10,000 random' pennu- 

posthybrjdrzatibn washw 0 3) were done as. described. For the expression rations of the label vector. The probability that a gene' had a larger or equal 

analyses, a standard reference (Universal Human Reference RNA; Stratageoc, weight by random permutation than the original weight was. denoted by a. A 

La Ma, CA) was used in all experiments. Forty /tg of reference RNA were, low at (<0,05) indicates a strong association between gene expression and 

labeled .with (^3-dUTP and 3.5 /*g of tesi.inRNA wilh Cy5-dtnT>, and-the amplification." 

labeled cDNAs were nybridized on microarrays as described (1 3, 15). For both Genomic Realization, of cpNA Clones and AmpHcon Mapping. Each 

rmcroarray analyses, a laser confocal scanner (AgUcm Technologies, Palo cDNA clone on the imcroarray was assigned to a Unigene cluster using the 

Alto, CA) was used to measure me fluorescence intensities at the target. Unigene Build 14J. 6 .A database of genomic sequence alignment information 

locauons using the DEARRAY software (16). After background subtraction, for mRNA sequences was created fiom the August 2001 freeze of the Urn*- 

average intensities at each clone in the test hyoridization were divided by the versity of California Santa-Cruz's GoldenJPath database. 7 The chromosome and 

avenge mtensity of the conesponding clone in me control hybridization,' For bp' potions for each cPNA clone were men retrieved by relating these data 
the copy number analysis, the ratios were noimalized 'on ..the basis of the . sets. Amplicons were defined as a CGH copy number, ratio >10 in.at least two 

distribution of ratios of all' targets on the array and for the expression' analysis adjacent clones in two or more cell lines or a CGH ratio >2.0 in at least three 

•on the basis of *8 nousekeeping genes, which were spotted four times onto the adjacent clones in a rintfe cell tine. The amplicon start and end positions were 

array. Low quality measurements (/.<?„ copy.nmnber data with mean reference . ■ 

intensity <I00 fluorescent units, and expression data with both test 1 and « tnr^ nHrW. m-.//™^ u-i -v lJ „ . * « • 
reference intels tty <I0O flupr^ccat Jte^M* „pot ri« <50 unitaj ^\^^^^^ mtn ^ ,i ^^^ > 
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TaWel Summary of independent amplicons tn J4 breast cancer cell lines by 
* CGB microarrtty 



Location 



lp13 
Iq2! 
Iq22 
3pJ4 

7o3l 
7q32 

8Q21.U~8q21.13 

8q23J-8q24,14 

8q24.22 

?p!3 

13n22-q3I. 

16<J22 

17qll * 

J7ql2-q2U 

17#UZ-q2l.33 

17^^33 

I7q[233^24J 

19ql3 • 

20qll.22 

20ql3J2 

20ql3.12-qJ3.13 

20ql3.i~o.t3.32 



own \mdj 


End (Mb) 


I3Z.79 


13234- 


17*92 


17745' 




179J7 ' 


7134 


74.66 . 


. 55.€2 


6035 


125.73 




140.01 


140.68 


86.45 


92.46 


98.45 


' 103X5 


129.88 




.15121 


152.16 


38.65 


.3905. 


77.15 


• 81.38 


86.70 


87.62 . 


2930 


3085 


39.79 


. 4230 


52.47 . 


$5.80 . 




69.70 


69.93 


' 7439 


' 40.63 


4U40 


.34i9 


3535 


' 44.00 


" 45.62 


46.45 


49.43 


51.32 


59U2 



Size (Mb) 



.0.2 

33 
03 
2.7 
53 
5.2 
0.7 
6.0 
4.6 
123 
1.0 
0.6 
43 
D3 
1.6 
3.0 
' 33- 
53 
S.i 

as 

.13 

i.6 

.3.0 
. 7.8 



CGH wete validated, with 17ql2-q21.2> 17q22^q2i, 20^13 j 

- and 20ql3.2 regions being most commonly amplified. Furthermore! 

_ the boundaries of these ampHoons were precisely delineated. In ad- 
dition, novel amptfeonsvwere identifie4 at.9pl3 (38.65-39.25 MhV 
and 17q213 (52,47-55,80 Mb). . f* 

Direct Identification of Putatiye Amplification Target Genes. 
The cDNA/CGH microarray technique enables the direct correla- 
tion of copy number and egression data- on a gene-by-gene basis 
. throughout the genome. We directly annotated. higVresqlution 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of foe amplified genes in the MCF-7 breast cancer 
cell line at lpl3, 17q22^q23, and 20ql3 were* hiahly overex- 
pressed. A view of chromosome -7 in. the> lyiDA-468 cell line 
implicates EGFJt as. the .most , highly oyerexpressed and amplified 
gene at 7pll-pl2 (Fig.. 3^). Li BT-474, the m known amplicons 
at .17ql2. and 17q22-q23 contained numerous highly overex- 
pressed genes (Fig. 3J?>. In addifion, several genes, including the 
homeobox genes HOXB2 *nd HdXB7, ^ere* highly amplified in a 
. previously undescnT>ed mdependent &OXB7 
was systennxtica^ kmpKfied (aa vaKdated by.FIJSH, Fig. ZB, foyer) 

- as well as overexpressed {as verified .by RT-BCR, data not shown) 
in BT-474, UACC812, and ZR-75-30 cells, turfliermore,, this i no^vel 



emended to ntcrode neighboring wmantplified clones (ratio, <15). The am- 
plicon dze determination was partially dependent on local clone density. ' 

FISH. Dual-color interphase FISH to ltfeast Cancer ceU lines was done" as 
described (17). Bacterial artificial chromosome clone BP11-361K8 was la-' 
beled with SpectnunOrange (Vyaia, Downer? Grove, IL), and Spectrum^ 
OcangeJabcted probe for EGFR was. obtained from Vysis. SpcctrumCreen- 
Ubeled chromosome 7. and 17 centromere probes (Vysis) were* used as a 
reference. A tissue microairay containing 612 fonnalin-fixed, paraf&tembed- 
ded primary breast cancers (17) was appHed in FISH analyses as described 
' (18). The use of these specimens was approved by the Ernies Committee of die 
University of Basel and by the. Nto 'Specimens containing a 2-fold or higher 
increase In the number of .test probe signals, as compared with corresponding 
centromere, signals, in at least 10% of the tumor cells were considered to be* 
amplified, Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test 

RT-PCR. The HOXB7 expression level was .ddcanin'od relative to 
GAPDH. Reverse transcription and PGR amplification were performed using. 
Access RT-PCR System (Promega Conx, Madison. WI) wim 10 ng ofmRNA 
si a template. HOXB7 primeis were 5 , -OA<3CA0AG<K}ACIXX3OACTT-.3' 
and 5'-GCCr^CACOTAGCGAT^OTAG-3^ 
H ■ '•-■*. * 

BESjULTS • . ~ • 




Global Effect of Copy Number, on Gene Expression. 13,824 
arrayed' cDNA clones were applied for analysis of gene expression 
and gene copy number (CGH microarrays) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
. -on gene expression patterns. Up to 44% of the highly amplified 
traiiscripts (CGH ratio, >2J) were overexpressed (U> belonged to . 
theglobal upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig/IA). Conversely, .10.5% 
of the transcripts wim high-level expression (cDNA ratio, >1G) 
showed, increased copy number (Fig. 15). Low-level copy number 
increases and decreases were also associated with similar, although' 
less dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer Amplicons. Base-pair 
locations obtained for i 1,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig- 2, Supple--, 
ment Fig A). The average spacing of clones throughout" the genome 
was 267 kWThis Wgh-iesoiution mapping identified 24 independent 
breast cancer amplicons, spanning from V2 to-12 Mb of DNA (Table 
1). Several amplification sites detected previously by chromosomal 




Fig. 1 Annotation of gene- expression <iata ira CGH microsiTBy profile*. A, genes la the 
7pl 1 -pl2 ampUcen m the MDA-468 cell line are highly expressed (red dots) and include 
(he SGFR oncogene. S. geveal genes in the I7ql2, 17q2! J, and 17q23 ampUcoss b the 
BT-474 breast cancer cell line are highly ovex&qsessed (red) and include the HQX37 
gene. .The data labeU and color coding are as indicated for Fig. 2G Insets show 
chromosomal CGH profiles for the corresponding chromosomes and validation of the 
increased copy number by injerphase FISH qahig SGFR (red) and chromosome 7 
cwrtromerc probe (green) to MDA-468. (4) and HCXB7spcdRQ' probe tW> and chro- 
mosome 17 centromere (green) to BT-474 cells (S), 
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^el^ynom^ betwera JTOAB;^^ Overall 

also had a significant influence on expression levels of genes in the our results illustrate how the identification of genes activated bv' 
itgions affected, but- these effects were more subtle on a gene-by-gene gene amplification provides a powerful approach to MfiWioht 
basis than those of high-level amplifications. However, the impact of genes with animportant role in cancer as well as'to prioritSTand 

low-level gains on the dysregularion of gene expression patterns in "validate putative targets for therapy development 
cancer may be equally important if not more important man that of ■ 



high-level amplifications. Anenploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on me. impact of aneuploidy on global gene expression pat- 
terns .in yeast cells, acute myeloid leukemia, and a prostate cancer 
model system .(22-24)^ * 

The CGH rmcroarray analysis identified 24 independent breast- 
cancer amplicons. We defined the precise boundaries for, many am- 
plicons detected previously by chromosomal CGH (?, 10, 25, 26) and 
also discovered novel amplicons mat had not been detected previ- 
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proxirnity to other larger amplicons.' One of these novel amplicons 
involved the homeobox gene region at 17q21 3 and led to the over- 
expression of the HOXB7 and HOXB2 genes. Hie homeodomain 
inscription factors are'leriown to be key regulators of embryonic 



egression in cancer (27, 28). HOXB7 transfection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
tumorigenicity and angiogenesis in breast cancer (29-32). The pres^ 
eat results .imply that gene amplification may be a prominent mech- 
anism for overexpressing HOXB7 in breast cancer and suggest that 
HOXB7. contributes to tumor progression and confers an aggressive 
disease phenotype in breast cancer. This view is- supported by our 
finding of amplification of HQXB7 in 10% of 363 primary breast 
cancers, as well as an assodation-of amplification With poor prognosis 



• We carried out a systematic search- to identify' .genes whose 
expression levels across all 14 cell lines were attributable to 
amplification status. Statistical analysis revealed 270 such-genes, 
(representing -2% of all genes on the array), including not only 
.previously. described amplified genes,* such as HER-2, MYC, 
EGFX, ribosomal protein s6 kinase, and AJB3 t but also numerous 
novel genes such as NRA&related gene (lpl3), syndecan-2 (8q22) v 
and bone morphogeny protein (20ql3.1), whose activation by 
ampUficatJon xnay similarly promote breast cancer progression. . 
Most of the 270; genet have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is. intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological' insights to breast cancer progression and might lead to 
the development . of novel therapeutic strategies. 
. In smnrnary, we. demonstrate application of cDNA microarrays 
to the analysis of both copy number and expression leveis-of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once* every 26f kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (b) a high-resolution map/of 24. independent 
arnplicons in breast cancer; and (c) identification of a set of 270 
genes, 'the overexpression of which was statistically attributable to 
gene amplification.- Characterization of a novel amplicon at 
17q21.3 implicated amplification and oyerexpression . of the 
HOXB7 gene in breast cancer, including a -clinical. association 
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Genomic DNA copy number alterations are key genetic events In 
the development and progression of human cancers. Here we 
report a genomejwfde microarray comparative genomic hybrid- 
• teatioii (array CGH) analysis of DMA copy number variation In 
■a series of primary human breast tumors, We have profited DNA 
copy number alteration across 6,691 mapped human genes. In 44 
predominancy advanced, primary breast tumors and 10 breast 
cancer ceB lines. While the overall patterns of DNA amplification 
and deJetJon corroborate previous cytogenetic studies, the hlgh- 
resolutionfaene-by-gene) mapping of amplkon boundaries and 
the quantitative analysis of ampllcon shape provide significant 
improvement In the localization of candidate oncogenes. Parallel 
microarray measurements, of mRNA levels reveal the remarkable 
degree to which variation In gene copy number contributes to 
varta^n in gene expression m tumor cells. Specifically, we find 
that 62% of highly amplified genes show moderately or highly 
elevated expression, that DNA copy number; Influences gene ex- 
pressfen across a wide range of DNA copy number alterations 
(detetfcwUow-. mid- and riJgMeveiamr>lffkattonXthaton average, 
a Mold change in DNA copy number Is associated with a corre- 
sponding Infold change in mRNA levels, and that overall, at least 
112% of all the variation In gene expression among the breast 
tumors fcdirectiy attributable to underlying variation In gene copy 
number. These findings provide evidence that widespread DMA 
copy number alteration can lead directly to global deregulation of 
gene expression, whkh may contribute to the development or 
progression of cancer. 

Conventional cytogenetic techniques, mchxdinfe comparative 
genomic hybrjdTzatida (CGH) (1), have led to the Wentifi- 
catoon of a number of recurrent regions of DNA copy number 
alteration in breast cancer cell lines and tumors (2-4). While 

f° me £~^^# ons contain k^Jwn or candidate oncogenes 
hfc .FOrat (Spll), MYC (8q24), OCND1 (llql3), ERBB2 

IBB! (13ql4) and TP53 (I7pl3)], the relevant gene(s) ^thin 
other regions (e-g., gain of lq, 8q22, and 17q22-24, and loss of 
m remain to be identified A high-resolution genome-wide 
map, delineating the boundaries of DNA copy number alter- 
ations in tumors, should facilitate the localization and identifi- 
cation of oncogenes and tumor suppressor genes in breast 
cancer, in this study, we nave created such a map, urine 
array-based CGH (5-7) to profile DNA copy number alteration 
in a series of breast cancer cell lines and primary tumors. 

An unresolved question is the extent to which the widespread 
UMA copy number changes that we and others have Identified 
m breast tumors alter expression of genes within involved 
regions. Because we had measured mRNA levels in parallel in 
the same samples (S% using the same DNA microarrays, we had 
an opportunity to explore on a genomic scale the relationship 
between DNA copy number changes and gene expression; From 

v^4>r«i^ fl /cgf/^ol/iaia73/pna$.1624719g9 



this analysis, we have identified a significant impact of wide- 
spread DNA copy number alteration on the transcriptional 
programs of breast tumors. 

Materials and Methods 

Tumors and Ceil tines. Primary breast tumors were predominantly 
large (>3 cm), intermediate-grade, infiltrating ductal carcino- 
mas, wuh i more than 50% being lymph node positive. The 
toctte of tomor cetts within specimens averaged at least 50%. 
Details of mdmdual tumors have been published (8. 9Y and 
are summarized in Table 1, which is published as porting 
information on the PNAS web site, www,pnas.org. Breast cancer 
ceil fanes were obtained from the American Type Culture 
ColJectiori. Genomic DNA was isolated either using Qiagen 

i G ?^? N l COl i imnS '. 0 . r * P^ol/chloroform extraction 
followed by ethanol precipitation. 

DNA UbeGng and Microarray Hybridizations. Genomic DNA labet- 
mg and hybridizations were performed essentially as described 
S SSK** f&W* ^ modifications. Two micrograms 
of DNA was abeled m a total volume of 50 microliters and the 
volumes of all reagents were adjusted accordingly, 'Test" DNA 
(from tumors and cell lines) was f hjorescentfy labeled (Cy5) and 
hybridized to a human cDNA microarray containing 6691 
different mapped human genes (i.e„ UniGene dusters). The 
>fcren<f" (labeled with Cy3) for each hybridization was nor- 
mal female leukocyte DNA from a single donor. The fabrication 
of JPr A niIcroarra y s tnc labeling and hybridization of 
mRNA samples have been described (8). 

Data Analysis and Map Positions. Hybridized arrays were scanned 
on a GenePa scanner (Axon Instruments, Foster City, CA), and 
fluorescence ratios (test/reference) calculated usingscANALYZE 
software (available at http://ranaibl.gov). Fluorescence ratios 
were normalized for each array by setting the average log 
fluorescence ratio for all array elements equal to 0. Measure- 
ments with fluorescence intensities more than 20% above back- 
ground were considered reBable. DNA copy number profiles 
that deviated significantly from background ratios measured in 
normal genomic DNA control hybridizations were interpreted as 
evidence of real DNA copy number alteration (see Estimating 
Significance of Abered Fluorescence Ratios in the supporting 
taformation). When indicated, DNA copy number profit are 
displayed as a moving average (symmetric 5-nearest neighbors). 
Map positions, for arrayed human cDNAs were assigned by 



Abbreviation: CGH comparative genomic hybridization. 

nowhom reprint request* should bo addressed a* Department of Pathology. Stanford 
CA 94J0S-S176. E-mafc pofladc10stanford.edu. ' 
♦♦Presem address: Zyomyx Inc. Haywa/d, CA 94S45. 

PNAS | October 1,2002 I vol.99 | oo.20 |. 12963-12968 



a 



ttatf*fer«nc« radix . 
305 1 >2 



Chta 1 





one of 6,691 different mapped human genespfe5cnt<wt^w^^S^S presents a different cell line or tumor, and each column represent* 

fold^pnflcaUon. green luminescence reflects fokWetetlon. andl^lnd^«™^^ CndtartedJ, such that red luminescence reflects 
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idenUftfng the starting position of the best and longest match of 

?L N ^S qucnce re P rescnt « J ^ the corresponding UniGene 
duster^lO) against the -Golden Path" ^cme* assembly 
(ht^://genome.ucsc.edu/; Oct 7, 2000 Freeze). For UniGene 
clusters represented by multiple arrayed elements, mean fluo- 
■^ncemtios(for^ 

cluster) are reported. For ruRNA measurements, fluorescence 
ratios are "mean-centered" (Le., reported relative to the mean 
ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety in the supporting information. 

Results 

^performed CGH on 44 predorninantfy locally advanced, 
Penary breast tumors and 10 breast cancer cell iinea, using 
fJ^TT staining 6,691 dififerent mapped humai 
genes (Fig. la; also see Materials and Methods for details of 
mfcroarray. hybridizations), To take full advantage of the im- 
proved spatial resolution of array CGH, we ordered (fluorcs- 
2*5 6,691 cDNAs according to the "Golden 

rain (ntcp://genoine.ucsaedu/) genome assembly of the draft 
human genome sequences (11). In so doing, arrayed cDNAs not 
only themselves represent genes of potential interest (eg., 
candidate oncogenes within ampKcons), but also provide precise 
genetic landmarks for chromosomal regions of amplification and 
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deletion. Paraflci analysis of DNA from cell lines containing 
afferent numbers of X crupmosomes (Fig. 16), asweddberm? 
(7), demonstrated the sensitivity of our method to detect simile- 
copy loss (45, XO), and 1.5- (47,XXX), % (48JOQOO/or 
25-fold (49,XXXXX) gains (also see Ffc 5, whlch^puh^hed 
as supporting information on the PNAS web site). Fluorescence 
•ratios were hnearty proportional to copy number ratios, which 
were slight^ underestimated, in agreement with previous ob- 
servations (7). Numerous DNA copy number alterations were 
evident in both the breast cancer cell lines and primary tumors 
(Fig. la), detected in the tumors despite the presence of euploid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found in every cancer cell Ike and tumor and 
on every human chromosome in at least one sample. Recurrent 
regions of DNA copy number gain and loss- were readily iden- 
tifiable. For e^mpl^.gains within lq, 8q, 17q, and 20q were 
*^P ro P° rtiofl of breast cancer cell lines/rumors 
P0%/69%. 100%/47%, 10Q%/6O* W and 90%/44%, respective- 
ryX as were losses within lp, 3p, fip, and 13q m%/24% 

with put?! ished cytogenetic studies (refc. 2-4; a complete listing 
of gams/losses.is provided in Tab!es2and 3, which ^republished 
as sopportmg information on the PNAS web site). The total 
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number of genomic alterations (gains and losses) was found to 
be significantly higher in breast tumors that were high grade (P ~ 
0,008), consistent with published CGH data (3), estrogen recep- 
tor negative (P - 0,04), and harboring TP53 mutations (P « 
0.0006) (see Tabic 4, which is published as supporting informa- 
tion on the PNAS web site). 

The improved spatial resolution of our array . CGH analysis is 
illustrated for chromosome 8, which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
uncharacterized oncogene (Fig. 7a). The complexity of amplicon 
structure is most easily appreciated in the breast cancer cell line 
SKBR3: Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed three distinct regions of high-level amplification (la- 
beled 1-3 in Fig. 7b). For each of these regions we can define the 



boundaries of the interval recurrently amplified in the tumors we 
examined; in each case, known or plausible candidate oncogenes 
can be identified (a description of these regions, as well as the 
recurrently amplified regions on chromosomes 17 and 20, can be 
found in Figs. 6 and 7, which are published as supportme 
information on the PNAS web site). 

For a subset of breast cancer cell lines and tumors (4 and 37, 
respectively), and a subset of arrayed genes (6,0951 mRNA 
levels were quantitatively measured in parallel by usmg cDNA 
microartays (8). The parallel assessment of mRNA levels is 
useful In the interpretation of DNA copy number changes. For 
example, the highly amplified genes that are also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity Co assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells. 

A strong influence of DNA copy number on gene expression 
is evident m an examination of the pseudocolor representations 
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of DNA copy number and mRNA levels for genes on chromo- 
some 17 (R[g. 3). The overall patterns of gene amplification and 
elevated gene expression are quite concordant; Le*, a significant 
fraction of highly amplified genes appear to be correspondingly 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is* not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DNA amplifica- 
tions (fluorescence ratios >4/ and. representing 91 different 
genes), 629& (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >Z), and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean-centered fluorescence 
ratios >4). 

To detennine the extent to which DNA deletion and lower- 
level amplification (in addition to high-level amplification) are 
also associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Fig. 4a). For both the 



breast cancer ceil lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant fashion (/> values for pair-wise Student's 
. f tests comparing adjacent classes: cell lines, 4 x 10"^, 1 X NT* 9 . 
5 X 10;* 1 X HT* tumors, 1 X 10" 43 , 1 X 5 X 10-", 

1 X 1(H), A linear regression of the average log(DNA copy 
number), for each class, against average Iog(mRNA level) 
demonstrated that on average, a Mold change in DNA copy 
number was accompanied by 1,4- and 1 .5-fold changes in mRNA • 
level for the breast cancer cell lines and t umors, Tespecaveiy (Fig. 
4a, regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 4b). 
The distribution of correlations forms a normal-shaped curve, 
but with the peak markedly shifted in the positive direction from 
zero. This shift is statistical^ significant, as evidenced in a plot 
of observed vs. expected correlations (Fig. 4c), and reflects a 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest correlations between DNA 
copy number and mRNA level (the right tail of the distribution 
in Fig. 4b) comprise both ampHfiecl and deleted genes (data not 
shown). Third, we- used a linear regression model to estimate the 
fraction of all variation raeasurecj in mRNA levels among the 37 
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tumors that could be attributed to underlying variation in DNA 
copy number. From this analysis, we estimate that, overall, about 
1% of ail of the observed variation in mRNA levels can-be 
explained directly by variation in copy number of the altered 
genes (Fig. Ad). We can reduce the- effects of experimental 
measurement error on this estimate by using only that fraction 
of the data most reliably measured (fluorescence Intensity/ 
background >3); using that data, our estimate of the percent 
variation in mRNA revels directly attributed to variation in gene 
copy number increases to 12% (Fig. 4$. This still undoubtedly 
represents a significant underestimate, as the observed variation 
in global gene expression is affected not only by true variation in 
the expression programs of the tumor cells themselves, but also 
by the variable presence of non-tumor cell types within clinical 



Discussion 

This genome-wide, array CGH analysis of DNA copy number 
alteration in a series of human breast tumors demonstrates the 
usefulness of defining amplicon boundaries at high resolution 
(gene-by-gene), and quantitatively measuring amplicon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing mRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 
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cell lines and tumors. Although the DNAmicroarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are examining such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we are likely underestimating the 
contribution of DNA copy number' changes to altered gene 
expression, we believe our findings are likely to be generalizable 
(but would nevertheless still be remarkable if only applicable to 
this set of -6,100 genes), 

In budding yeast, aneuploldy has been shown to result In 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression in cancer cells. In 
agreement with bur findings, Phillips et aL (14) have shown that 
with the acquisition of tumorigenicity in an immortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, Platzer et al (15) recently reported that in metastatic 
colon tumors only -4% of genes within amplified regions were 
found more highly (>2~fbld) expressed; when compared with 
normal colonic epithelium. This report differs substantially from 
our rinding that 62% of highly amplified genes in breast cancer 
exhibit at least 2-fold increased expression. These contrasting 
findings may reflect methodological differences between the 
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studies. For example, the study of Platzer et at (IS) may have 
systematically under-measured gene expression changes. In this 
regard it is remarkable that only 14 transcripts of many thousand 
residing within unamplified chromosomal regions were found to 
exhibit at least 4-fold altered expression in metastatic colon 
cancer. Additionally, their reliance on lower-resolution chromo- 
somal CGH may have resulted in poorly delimiting the bound- 
aries of high-complexity ampJicons, effectively ovcrcalling re- 
gions with ampBfication. Alternatively, the contrasting findings 
for amplified genes may. represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications. 
First, this finding supports a high degree of copy number- 
dependent gene expression in tumors. Second, it suggests that 
most genes are not subject to specific autoregulation or dosage 
compensation. Third, this finding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor* 
igenesis. In our study, fully €2% of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high-resolution mapping of ampli- 
con boundaries and shape {to identify the "driving** gene(s) 
- within ampllcons (16)] , on a large number of samples, In addition 
to functional studies. Fourth, this finding suggests that analyzing 
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the genomic distribution of expressed genes, even within existing 
nUcroarray gene expression data sets, may permit the inference 
of DNA copy number aberration, particularly aneuploidy (where 
gene expression can be averaged across large chromosomal 
regions; see Fig. 3 and supporting information). Fifth, this 
finding, implies that a substantial portion of the phenotypic 
uniqueness (and by extension, the heterogeneity in cUnicaJ 
behavior) among patients' tumors may be traceable to underly- 
ing variation in DNA copy number. Sixth, this finding supports 
a possible role for widespread DNA copy number alteration in 
tumorigenesis. (17, 18), beyond the amplification of specific 
oncogenes. and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression, might disrupt critical 
stochiometric relationships in cell metabolism and physiology 
(e.g^ prpteosome, mitotic spindle), possibly promoting further 
chromosomal instability and directly contributing to tumor 
development or progression* Finally, our findings suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances fn gene expression in cancer. 
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Abstract 

Background: Colorectal cancer is a common cancer all over the world. Aberrations in the cell 
cycle checkpoints have been shown to be of prognostic significance in colorectal cancer. 

Methods: The expression of cydin D/, cyclin A, histone H3 and Ki-67 was examined in 60 colorectal 
cancer cases for co-regulation and Impact on overall survival using immunohistochemistry, 
southern blot and In situ hybridization techniques. Immunoreactivity was evaluated semi 
quantitatively by determining the staining index of the studied proteins. 

Results: There was a significant correlation between cydin D/ gene amplification and protein 
overexpression (concordance = 63.6%) and between Ki-67 and the other studied proteins. The 
staining index for K'h67, cydin A and 01 was higher in large, poorly differentiated tumors. The 
staining index of cydin D I was significantly higher in cases with deeply invasive tumors and nodal 
metastasis. Overexpression of cydin A and 0/ and amplification of cydin 0/ were associated with 
reduced overall survival. Multivariate analysis shows that cydin 0 / and A are two independent 
prognostic factors in colorectal cancer patients. 

Conclusions: Loss of cell cycle checkpoints control is common in colorectal cancer. Cydin A and 
D / are superior independent Indicators of poor prognosis in colorectal cancer patients. Therefore, 
they may help in predicting the clinical outcome of those patients on an individual basis and could 
be considered important therapeutic targets. 



Background 

Colorectal cancer (CRC) is the third most common cancer 
in Western countries (1). In Egypt, CRC has unique char- 



acteristics that differ from that reported in other countries 
of the western society. It was estimated that 35.6% of the 
Egyptian CRC cases are below 40 years of age and patients 
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usually present with advanced stage, high grade tumors 
that carry more mutations [2], This uniquely high propor- 
tion of early-onset CRC, the early and continuous expo- 
sure to hazardous environmental agents, the different 
mutational spectrum and the prevalent consanguinity in 
Egypt justify further studies [3). It was proved that most 
cancers result from accumulation of genetic alterations 
involving certain groups of genes, the majority of which 
are cell cycle regulators that either stimulate or inhibit cell 
cycle progression [1]. Cell proliferation allows orderly 
progression through the cell cycle, which is governed by a 
number of proteins including cyclins and cyclin dependent 
kinases (4,5]. The cyclins belong to a superfamily of genes 
whose products complex with various cyc/m-dependent 
kinases [cdks) to regulate transitions through key check- 
points of the cell cycle JGJ. Abnormalities of several cyclins 
have been reported in different tumor types, implicating, 
in particular, cyclin A, cyclin E and cyclin D [6,7|. 

Cyclin Dl is a Gl cyclin that regulates the transition from 
Gl to S phase since its peak level and maximum activity 
are reached during the Gl phase of the cell cycle. Whereas 



cyclin A is regarded a regulator of the transition to mitosis 
since it reaches its maximum level during the S and G2 
phases |8j. The mechanisms likely to activate the onco- 
genic properties of the cyclins include chromosomal trans- 
locations, gene amplification and aberrant protein 
overexpression |7,9]. 

Several studies have shown that, histone H3 mRNA expres- 
sion can be used to identify the S phase fraction (SPF) 
through the in situ hybridization (ISH) technique 1 10,1 1]. 
The level of histone H3 mRNA reaches its peak during the 
S phase and then drops rapidly at the G2 phase |12|. 

In race of the increasing incidence of CRC and its peculiar 
pattern in the Egyptian population, the present study was 
conducted to assess the role of Ki-67 (pan-cell cycle 
marker), cyclin Dl (Gl phase marker), histone H3 mRNA 
(S phase marker), cyclin A (S to G2 phase marker) in CRC. 
The expression level of these markers was correlated to the 
clinicopathologic features and the overall survival of 
patients. 



Table I : CHnicopathological features of patients in relation to the staining index (SI) of KJ-67, cyclin Dl, cyclin A, histone H3 

SI (mean + SDJ 



Variables No. of cases fO-67 Cyclin Dl Cyclin A Histone H3 



Sex 



Male 


36 


18.0 ±6.4 


Female 


24 


20.1 15.8 


Age (years) 






250 


41 


11.716.0* 


<S0 


19 


23.8 1 5.6 


Tumor size (cm) 






<5.0 


33 


12.216.3* 


25.0 


27 


30.1 16.2 


Histology 






Normal 


20 


3.5 1 2.0* 


Carcinoma 


60 


30.3 1 6.2 


Gl 


15 


11.716.2 


Git 


21 


11.815.6 


Gill 


24 


30.0 1 4.3 


Lymph node 






Negative 


33 


19.517.0 


Positive 


27 


21.314.9 


Depth of Invasion 






m, sm 


17 


207 1 6.7 


beyond sm 


43 


21.916.2 


Stage 






1 


6 


20.6 1 6.7 


l\ 


27 


20.8 1 6.9 


III 


12 


22.0 1 5.4 


IV 


15 


24.716.1 



6.714.3 12.7 l5J 10.715.3 

8.818.4 10.016.0 10.715.4 

5.615.2 10.015.3 6.015.0* 
7.716.8 13.6 15.7 2Z0 1 5.2 

5.313.8* H.5 16.1* 10.314.9* 

22.817.2 28.615.6 24.015.6 

0.610.2* 2.3 11.1* 2.2 10.9 

24.916.3 27.215.8 10.715.3 
6.6 14.0 10.015.4 11.414.9 
8.9 13.6 12.3 16.5 7.8 1 5.4 
22.018.1 27.014.9 11.51 5.4 

5.4 1S.3* 11.9 16.5 12.315.5 

20.616.9 12.5 15.0 14.2 1 5.0 

1113.1* 11.917.2 10.415.1 

12.416.5 12.215.6 10.715.4 

5.71 6.9 24.216.9 11.115.3 

5.314.3 24.616.0 10.4 1 S.7 
7.716.0 27.115.2 10.414.9 

11.319.6 27315.5 12.316.2 



* p. value < 0.05 (significant) 
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Methods 

Tissue samples 

Paraffin-embedded tumor tissues were obtained from 60 
CRC patients (47 colon and 13 rectal carcinomas) that 
were diagnosed and treated at the National Cancer Insti- 
tute, Cairo, Egypt during the period from January, 1997 to 
June, 2002. CHnicopathological data of the studied cases 
are illustrated in table 1. None of the patients received any 
chemotherapy or irradiation prior to surgery. Histological 
diagnosis of all cases was done by 2 independent pathol- 
ogists according to the WHO Histological Classification. 
Tumors were staged according to the TNM staging system 
[13]. The depth of tumor invasion was classified as inva- 
sion of the mucosa including muscularis mucosa (m), 
invasion of the submucosa (sm), or invasion beyond the 
submucosa [8]. Normal colonic tissues were obtained 
from autopsy specimens (n = 20) and were used as a con- 
trol. The actual survival rate of the padents was calculated 
from the date of resection to the date of death. 

Immunohistoch emistry 

Four micron sections of each normal and tumor specimen 
were cut onto positive-charged slides; air dried overnight, 
de-paraffinized in xylene, hydrated through a series of 
graded alcohol and washed in distilled water and 0.01 
PBS (pH 7.4). Slides were then processed for IHC as 
described by Handa et al. [8]. using the following anti- 
bodies: Ki-67 (MIB-1, Dako), cyclin A (6E6; Novocastra, 
Newcastle-Upon-Tyne, UK) and cyclin Dl (DCS-6, Dako). 
A case of invasive breast cancer was used as a positive con- 
trol for Ki-67 and cyclin A whereas a case of mantle cell 
lymphoma was used as a control for cyclin Dl. Negative 
controls were obtained by replacing the primary antibody 
by non-immunized rabbit or mouse serum. 

Brown nuclear staining was regarded as a positive result 
for all studied markers. The proportion of positively- 
stained cells and the intensity of staining were scored in 
tumor and normal colorectal mucosal sections at medium 
power (*200). The degree of positive tumor staining (per- 
centage of positive tumor cells in the examined section) 
was scored from 1-6 and the staining intensity was scored 
from 0-6 according to the pattern of staining in the exam- 
ined section. Staining index (SI) was calculated by multi- 
plying the cellularity and staining scores as described by 
Kingetal. [14]. 

In situ hybridization 

All tumor samples and 5 normal controls were assessed 
for histone H3 mRNA by ISH using the commercially avail- 
able 550 base fluorescein-labeled DNA probe (Dako, 
Carpinteria, CA) as described by Nagao et al., 1996. This 
probe hybridizes to the whole mRNA transcript of the 
human hi$toneH3 gene including the5' and 3' untrans- 
lated regions. Scoring of histone H3 mRNA was performed 



as for immunohistochemistry, however, hybridization 
signals were detected in the cytoplasm. 

Molecular detection of cyclin D I gene amplification 
High molecular weight DNA was extracted from parafTin- 
embedded tissues of the tumor and normal colorectal 
mucosal samples as previously described (15). The pro- 
portion of neoplastic and normal cells was determined in 
each tumor sample by examining hematoxylin and eosin- 
stained slides obtained from the edge of the specimen 
used for DNA extraction. Tumor samples were evaluated 
for amplification of cyclin Dl if more than 75% of the 
examined sections were formed of neoplastic cells. 
Accordingly, 50 cases were eligible for the analysis. Ten 
micrograms of the extracted DNA was digested with 
EcoRl. DNA from selected cases was also digested with 
Bglll and HindUl. Samples were separated on 0.8% agar- 
ose gels and transferred to Hybond-N membranes (Amer- 
sham Int., Amersham, UK). The membranes were 
hybridized with 50% formamide, 5 * SSC, 5 x Denhardt's, 
500 ug/ml denatured salmon sperm DNA, 10% dextran 
sulphate and 10°cpm/m! of 32 P-labeled PRAD-I probe for 
24 h. Membranes were washed with 2 * SSC, 0. 1% SDS at 
room temperature for 30 min followed by 2 x SSC, 0.1% 
SDS at 60° C for 30 min and 0.1 * SSC, 0. 1% SDS at 60 °C 
for 1 h. Filters were autoradiographed using an intensify- 
ing screen at -70 °C for 24-72 h. After being stripped free 
of the PRAD-1 probe, the same blots were hybridized with 
32 P-labeled B-actin probe to normalize against possible 
variations in the loading or transfer of DNA. The auto ra- 
diograms were analyzed using a densitometer. Intensities 
oiPRAD-1 /cyclin Dl were normalized to the 0-actin con- 
trol bands. The degree of amplification was calculated 
from these normalized values. Amplification was consid- 
ered when the signal of the tumor band was >2-fold the 
value of the matched normal mucosa [16]. 

Statistical analysis 

The Mann-Whitney non-parametric test was used to com- 
pare the Sis of pairs of subjects whereas the Kruskal-wallis 
was used for categorial data. Correlation between indices 
was performed using a simple linear regression test. The 
Kaplan-Meier method was used to create survival curves 
which were analyzed by the log-rank test. The impact of 
different variables on survival was determined using the 
Cox proportional hazards model, p. values less than 0.05 
were considered significant. 

Results 

The results of IHC are illustrated in figures 1 and 2. In gen- 
eral, the staining index (Sis) of all studied markers was 
higher in carcinomas than in normal colonic mucosal 
samples (p = 0.0001). Normal colorectal mucosa revealed 
positive imunostaining for Ki-67 in the lower half of the 
crypts only. A heterogeneous staining pattern was 
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Figure I 

Normal colonic mucosa showing positive nuclear immunostaining for: (a) cyclin D/, (b) ISH of histone H3 mRNA. (c) Kh67 and 
(d) cyclin A 



detected in the neoplastic ceils of well and moderately-dif- 
ferentiated adenocarcinomas whereas a diffuse homoge- 
neous staining pattern was detected in poorly- 
differentiated carcinomas. The SI ranged from 10-40.2 
(mean: 24.6 ± 6.5). 

Immunostaining for cyclin Dl was predominantly nuclear 
but cytoplasmic staining was detected in some cases. 
However, unless a nuclear staining was also detected, 
cases with cytoplasmic staining were considered negative- 
Normal colorectal mucosal samples were almost negative 
for cyclin Dl whereas 4 1 out of the 60 (68.3%) CRC cases 
were positive. Marked heterogeneity was observed in well- 
and moderately-differentiated adenocarcinomas even 
within the same tumor. Poorly-differentiated carcinomas 
revealed a diffuse staining pattern with more darkly- 
stained nuclei. The SI ranged from 0.5-28.6 (mean: 9.3 ± 
4.2). 



Positive nuclear staining for cyclin A was detected in 80% 
(48/60) of CRC cases and in all non-neoplastic control 
samples. Positively-stained nuclei were confined to the 
lower half of the crypts in normal colonic mucosa and dif- 
fusely-dispersed in carcinomas. The SI ranged from 3.3- 
30.2 (mean: 15.1 ±6.6). 

Histone H3 mRNA was intensely expressed in the cyto- 
plasm of all examined samples either neoplastic or non- 
neoplastic. The distribution of histone H3 mRNA was 
similar to that of cyclin A and Ki-67 however, the propor- 
tion of histone H3 mRNA positive cells was less than that 
of Ki-67. The SI ranged from 1.8-24.2 (mean: 12.4 ± 5.3). 

The PRAD-l probe detected 3 EcoRI fragments of 4.0, 2.2 
and 2.0 and 1 Bglll fragment of 15 Kb. PRAD-l/cyclin Dl 
gene amplification was detected in 22/50 (44%) cases 
analyzed. The degree of amplification was heterogeneous 
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Figure 2 

A case of well differentiated adenocarcinoma with positive immunostaining for: (a) cydin 01 1 (b) histone H3 mRNA, (c) Kf-67, 
and (d) cydin A. Another case of moderately differentiated denocardnoma with positive immunostaining for; (e) cyclin D I , (f) 
nrstone H3 mRNA, (g) Ki-67 t and (h) cyclin A. A case of poorly differentiated adenocarcinoma with diffuse staining for: (i) cydin 
D/, (j) ISH of histone H3 mRNA, (k) 7 and (I) cydin A. 



with 2-10 fold increase when compared to normal 
mucosal samples (Figure 3). Amplification was confirmed 
by other restriction enzymes. 

Correlations 

There was a significant correlation between cyclin Dl gene 
amplification and protein overexpression. Out of the 22 



cases that showed amplification 14 showed protein over- 
expression (concordance = 63.6%). 

Linear regression analysis of Sis revealed a significant cor- 
relation between Ki-67 and cyclin Dl, cyclin A, histone H3 
as well as between the Sis of cyclin A and histone H3 (p = 
0.008, 0.0001, and 0.0001 respectively) (Figure 4). There 
was a significant relationship between the SI of both Ki-67 
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Figure 3 

A: Southern blot analysis of normal mucosa (N) and their seven corresponding cases of colonic adenocarcinomas (TI-T7), 
cases No. I, 2, 4, and 5 are poorly differentiated whereas cases No. 3, 6. and 7 are moderately differentiated. Genomic DNA 
was digested with Bg/ll, fractionated by electrophoresis in agarose gel. transferred onto membranes and hybridized with PRAD I 
and fiuctin, Tumors number l-£ (Lanes I-6) show different degrees of PRAD Ucydin 0 \ amplification, tumor number 7 (lane 7) 
was not amplified. B: Southern blot analysis of 3 cases of adenocarcinomas (T) and matched normal colonic mucosa (N). 
Genomic DNA was digested with EcoRI, fractionated by electrophoresis in agarose gel, transferred onto membranes and 
hybridized with PRAD I and y£ocun probes for loading control. The identification of the 3 tumors is the same as in Fig. 3 A with 
amplification of PRAD Ucydin Di in tumors number 4, S (Lanes I, 2) but not 7 (Lane 3). 



and cyclin A and the degree of differentiation of tumors as 
well as the size of the tumor (p < 0.001 and p < 0.01 
respectively). In addition, SI of Ki-67 and histone H3 were 
higher in patients <50 years than in those £50 years (p < 
0.05) (table 1). 



In addition table 2 shows a significant relationship 
between high cyclin Dl SI and large, poorly-differentiated 
tumors, carcinomas with positive lymph node metastasis 
and deeply-invasive carcinomas (p < 0.05, p < 0.001, p < 
0.0S and p < 0.05 respectively). Whereas cyclin Dl gene 
amplification was significantly associated with an 
advanced disease stage since amplification was detected in 
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Figure 4 

Correlation between the staining intensity of (a) Kf-67 vs. cyd'm 01 1 (b) Ki-67 vs. histone H3> (c) fO-67 vs. cydin A and (d) cyctfn A 
vs. /i/stone H3 mRNA expression. 



10/15 (66.7%) of stage IV tumors compared to 12/45 
(26.7%) of stage Mil tumors (p = 0.002). Similarly, DNA 
amplification was detected in 60.5% (26/43) of the carci- 
nomas with extensive local invasion (beyond sm) but 
only in 23.5% (4/17) of the carcinomas with limited inva- 
sion (m, sm) (p = 0.001). A significant correlation was 
also present between cyclin Dl gene amplification and the 
presence of lymph node metastasis (p = 0.008) as well as 
between the SI of histone H3, the size of the tumor and the 
patient's age (p < 0.05, p < 0.001 respectively). The SI was 
higher in tumors >5 cm in diameter and in patients <50 
years. 

Survival analysis 

The mean follow-up period for all patients was 30 months 
(range: 1-66 months). Eighteen of 60 patients had 
already died by the time the study was completed. We 



defined the cutoff level for overexpression of each cell 
cycle marker at the point that showed the maximum dif- 
ference of survival rate between the 2 groups separated by 
that point. Cox regression analysis revealed that cyclin A 
overexpression (our definition: SI £ 10.5), cyclin Dl over- 
expression (our definition: SI £6.1), poorly differentiated 
histology, lymph node metastasis, TNM stage, tumor size 
and depth of invasion were all significant prognostic var- 
iables for survival (Table 3). The Kaplan-Meier survival 
curves for the subgroups of patients who are subdivided 
according to each marker's status are shown in Figure 5. 
Patient with tumors that showed fCi-67 overexpression 
(our definition: SI £ 1 1.5) and histone H3 overexpression 
(our definition: SI £ 8.2) tended to have poor prognosis 
but this did not reach a statistically significant level, how- 
ever the overall survival was significantly lower in patient 
with cyclin A and cyclin DJ overexpression. Cox rnultivari- 
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Table 2: The relation between cydin Dl overexpression vs cydin Dl amplification and cllnieo pathological prognostic markers. 



Variable* 


No. of cases 


Cydin Dl overexpresslon 


Cydin Dl Amplification 


Tumor size (cm) 








<s.a 


33 


5.3 ± 3.8* 


13/33 


25.0 


27 


2Z8±7.2p<0.05 


9/27 p <0.236 


Histology 






CI 


15 


6.6 ± 4.0 


7/15 


Gil 


21 


8.9 ± 3.6 


8/21 


Gill 


24 


2Z0±8.J p<0.Q0l 


7/24 p <0.075 


Lymph node 






Negative 


33 


S.4 ± 5.3* 


6/33(18.2%) 


Positive 


27 


20.6 ± 6.9 p <0.05 


16/27 (59.3%) p<0.008 


Depth of invasion 








m, sm 


17 


3.1 ±3.1* 


4/17(23.5%) 


beyond sm 


43 


IZ4±6.5p<0.05 


26/43 (60.5%)p«U)Of 


Stage 








early 


45 


5.5 ± 10.1 


12/45(26.7%) 


late 


15 


ll.3±9.6P = 0./75 


10/15 (66.7%) p<0.002 


Table 3: U univariate analysis of the relationship between survival and the tested markers 


PredictiveVariables 


Median Survival 


hr a 


P 



Kf-67 



<ll.5 


36 








£11.5 


32 


1.826 


0.636-5.243 


0.26 


Cydin Dl 










<6.l 


35 








*6.l 


18 


7.246 


1.007-45.150 


0.03* 


Hlstone H3 










<8.2 


35 








7>82 


29 


4.639 


0.854-25.196 


0.07 


Cydin A 










<I0.5 


35 








fclOJ 


15 


7.820 


1.017 -60. 122 


0.02* 


Histological grade 










Low 


38 








High 


10 


7.331 


2.696-19.940 


0.0001* 


Lymph node 










Negative 


38 








Positive 


15 


6.826 


1.973 -23.621 


0.002* 


Stage 










J, //, /// 


38 








IV 


12 


6.378 


1.842-22.083 


0.001* 


Tumor size (cm) 










<S.O 


35 








2>5.0 


13 


4.835 


1.386-16.868 


0.01* 


Depth of Invasion 










TI.T2 


36 








T3.T4 


20 


7.759 


1.024 -58.789 


0.04* 


Age (years) 










<50 


38 








S50 


28 


Z802 


0.988 - 7.943 


0.0526 


Sex 










Male 


38 








Female 


36 


0.696 


00.274- L766 


0.4449 



* p. value < 0.05 (significant) 

HR: Hazard Ratio 

CI: 95% confidence Interval 
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Figure 5 

Kaplan-Meier survival curves for colorectal carcinoma. Overall survival is significantly lower in patients with (a) cyclin A and (b) 
cydtn 01 overexpression. Patients with high SI for histone H3 mRNA have poorer prognosis but this was not statistically signif- 
icant (c). No significant difference was present between patients with high K'h67 SI and those with low Kf-67 SI (d). 



ate regression analysis revealed that lymph node metasta- 
sis, cyclin A and cyclin Dl overexpression were 
independent negative prognostic factors after adjustment 
for the depth of tumor invasion, age and sex of the patient 
(Table 4). 

Discussion 

The proliferative activity of CRC cells has been investi- 
gated in several studies either by immunohistochemical 
determination of cell proliferation index using antibodies 
to some types of cyclins or by flowcytometric determina- 
tion of the SPF of the cell cycle [8]. Although Leach et al. 
[17] did not find cyclin Dl gene amplification in a panel 
of 47 CRC cell lines; its protein was overexpressed in 
about 30% of CRC cases that were included in the studies 



of Bartakova et al. [6] and Arber et at. [18]. In the former 
study \G\cyclin Dl was aberrantly accumulated in a 
significant subset of human CRC cases and the cell lines 
derived from these cases were dependent on cyclin in their 
cell cycle progression. In the second study [18], overex- 
pression of cyclin Dl was detected in 30% of adenomatous 
polyps indicating that overexpression is a relatively early 
event in colon carcinogenesis which is possibly responsi- 
ble for the pathological changes in the mucosa preceding 
neoplastic transformation. More recently, Holland et al. 
[19], Pasz-Walczak et al. [20] and Utsunomiya el al. [21 1 
reported up- regulation of cyclin Dl in 58.7%, 100% and 
43% of their studied cases respectively. 
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Table 4: Multivariate analysis ofthe relationship between survival 
and thetested markers 



PredicttveVariabtes 


HR 


a 


P 


Cyclin D I 


10.864 


1.055 - 86.250 


0.03* 


(baseline < 6.1) 








CydinA 


13.686 


1.012-190-579 


0.0490* 


(baseline < 10.5) 








Positive Lymph node 


3.921 


1.057- 14.472 


0.0410* 


metastasis 








Stage IV 


3.411 


1.048- 12.083 


0.03* 


Depth of invasion 








T3.T4 


5.408 


0.449 - 65.080 


0.1836 


Age (years) 








250 


1.996 


0.678 - 5.878 


0.2310 


Sex 


0.910 


0.315-2.358 


0.8453 



p. value < 0.05 (significant) 

HR: Hazard Ratio 

CI: 95% confidence Interval 



In the present study, up-regulalion of cyclin Dl was 
detected in 68.3% of the cases. The SI was significantly 
higher in carcinomas than in normal colorectal mucosa 
and in poorly-differentiated adenocarcinomas it was 
approximately twice that of other histological types. 
Amplification and/or overexpression of cyclin Dl signifi- 
cantly correlated with deeply invasive tumors and positive 
lymph node metastasis. Our results in this regards are con- 
sistent with previous studies [8,22]. In 2001, Holland et 
al. [19]. demonstrated that deregulation of cyclin Dl and 
p21™i proteins are important in colorectal tumorigenesis 
and have implications for patient prognosis. Similarly 
McKay et al. (23] found that cyclin Dl was the only protein 
in their panel (cyclin Dl, p53, pl6, Rb-1, PCNA and p27) 
that correlated with improved outcome in CRC patients. 
However, few studies failed to detect any correlation 
between cyclin Dl overexpression and the 
clinicopathological factors in CRC [6,18]. This contro- 
versy in results could partially be explained by the differ- 
ence in the sampling of studied cases. The present study 
included 24 cases of poorly differentiated adenocarci- 
noma, which is not common in other studies of CRC in 
western countries. This was possible because the majority 
of CRC cases diagnosed in Egypt are of high histological 
grade |3|. The correlation between cyclin Dl overexpres- 
sion and the high histological grade was also reported in 
other tumor types including non-small cell lung 
carcinomas [24] and squamous ceil carcinomas of the lar- 
ynx [16]. Another possible explanation for the observed 
controversy in the results of different studies is the detec- 
tion method used. 

In the present work, overexpression of cyclin Dl was more 
common than gene amplification of the PRAD-l/cyclin Dl 



gene with a 63.6% concordance. This was similarly 
reported by Bartakova et al. |6] who mentioned that there 
is a subset of CRC cases in which cyclin Dl is overex- 
pressed without PRAD-l/cyclin Dl gene amplification. 
Consistent with this hypothesis are reports of elevated cyc- 
lin Dl mRNA levels and immunohistochemically detecta- 
ble accumulation of the protein in over one third of breast 
cancer cases at a frequency significantly higher than that 
deduced from DNA amplification studies [9,25]. These 
data imply that mechanisms other than gene 
amplification can also lead to deregulation and accumu- 
lation of cyclin Dl in solid tumors. 

So far, several studies were done to reveal the prognostic 
significance of cyclin Dl overexpression in various carci- 
nomas, including CRC [22]. However, these studies 
yielded conflicting results which could be attributed to 
organ heterogeneity. In our study, patients with tumors 
that exhibited cyclin Dl overexpression tended to have 
poor prognosis. 

It was reported that, patients with cyclin A positive carci- 
nomas had significantly shorter median survival times. 
Handa et al. [8] were able to detect cyclin A overexpression 
in 77% of their CRC cases. They also demonstrated that, 
cylcin A could be used as a prognostic factor of CRC. More 
recently, Habermann et at. [26] studied cases of ulcerative 
colius with and without an associated adenocarcinoma 
for the presence of cyclin A overexpression. They found 
that, cyclin A overexpression was higher in cases of ulcera- 
tive colitis with adenocarcinomas than in those without 
adenocarcinomas. Consequently, they concluded that, 
cyclin A could be used for monitoring ulcerative colitis 
patients and for the early detection of an emerging carci- 
noma in this high risk group of patients. 

In our study, cyclin A was detected in 80% of the patients 
and Cox regression analysis showed that it could be used 
as a prognostic marker in CRC in addition to cyclin Dl . 

It would have been useful if we assessed the expression 
level of cyclin A by another technique (DNA 
amplification). This would have added more information 
regarding the gene status on one hand and confirmed the 
results of IHC on the other hand. Unfortunately, this was 
not possible because in most of the cases included in the 
present work, the extracted DNA was not sufficient to 
study cyclin amplification after the assessment of cyclin DL 

In 1996, Nagao et al. [11] reported that histone H3 labe- 
ling index significantly correlated with ki-67 immunos- 
taining and was high in poorly differentiated human 
hepatocellular carcinoma. This was similarly reported in 
the present work since we found a significant correlation 
between the SI of histone H3 and Ki-67, However, no 
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statistically significant correlation was found between Jm- 
tone H3 SI and any of the studied clinicopatho logical 
factors. 

Although Ki-67 immunostaining reflects the proliferative 
activity of CRC, it has not been recognized as a significant 
prognostic factor in this type of tumors [27,28]. However, 
Suzuki at al. [29] found a significant correlation between 
Ki-67 labeling index and local invasion of CRC. In the 
present study there was a significant relationship between 
the SI of Ki-67, tumor size and grade. However, Kaplan- 
Meier survival curves showed no significant difference in 
survival rates between patients with- and without overex- 
pression of Ki-67. 

Conclusions 

Our results demonstrate that cyclin Dl, cyclin A, histone H3 
and Ki-67 are overexpressed in a subset of CRC, however 
only cyclin Dl and cyclin A overexpression correlates with 
poor differentiation and tumor progression. This indi- 
cates the superiority of cyclin A and cyclin Dl as indicators 
of poor prognosis compared to Ki-67 and histone H3 
mRNA in CRC. Cyclin A and Dl could therefore be consid- 
ered significant, independent prognostic factors in CRC 
patients. These findings are especially important in stage 
II patients since 25-30% of those patients have poor prog- 
nosis in spite of being node-negative. However, the stand- 
ard dinicopathologic prognostic factors can not identify 
this subset accurately and therefore; there is a great 
demand for more accurate, individually-based, biological 
prognostic parameters that help in detecting this high risk 
group of patients who can benefit from an adjuvant ther- 
apy. If the findings of the present study are confirmed in a 
larger study, evaluation of cyclin A and Dl may be applica- 
ble to clinical management of CRC, allowing the identifi- 
cation of patients with poor prognosis. 
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and immunohistochemical analyses 
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In this study, we analysed gene amplification, RNA expression and protein expression of the c-myc gene on archival tissue specimens 
of high-grade human breast cancer, using fluorescent in situ hybridisation (FISH), nonradioactive in situ hybridisation and 
immunohistochemistry. The specific question that we addressed was whether expression of c-Myc mRNA and protein were 
correlated with its gene copy amplification, as determined by FISH. Although c-Myc is one of the most commonly amplified 
oncogenes in human breast cancer, few studies have utilised in situ approaches to directly analyse the gene copy amplification. RNA 
transcription and protein expression on human breast tumour tissue sections. We now report that by using the sensitive FISH 
technique, a high proportion (70%) of high-grade breast carcinoma were amplified for the c-myc gene, irrespective of status of the 
oestrogen receptor. However, the level of amplification was low. ranging between one and four copies of gene gains, and the majority 
(84%) of the cases with this gene amplification gained only one to two copies. Approximately 92% of the cases were positive for c- 
myc RNA transcriptioa and essentially all demonstrated c-myc protein expression. In fad a wide range of expression levels were 
detected Statistically significant correlations were identified among the gene amplification indices, the RNA expression scores and 
protein expression scores, c-myc gene amplification, as detected by FISH, was significantly associated with expression of its mRNA. as 
measured by the intensity of in sito hybridisation in invasive cells (P = 0.0067), and by the percentage of invasive cells positive for 
mRNA expression (P = 0.0006). c-myc gene amplification was also correlated with the percentage of tumour cells which expressed 
high levels of its protein, as detected by immunohistochemistry in invasive cells (P = 0.00 1 6). Thus, although multiple mechanisms are 
known to regulate normal and aberrent expression of c-myc, in this study, where in situ methodologies were used to evaluate high- 
grade human breast cancers, gene amplification of c-myc appears to play a key role in regulating expression of its mRNA and protein. 
British Journal of Cancer (2004) 90, 1612- 1619. dot: 1 0. 1 0367sj.bjc.660 1 703 www.bjcancer.com 
Published online 30 March 2004 
©2004 Cancer Research UK 

Keywords: c-myc breast cancer; gene amplification; gene expression 



The c-myc oncogene has been shown to be amplified and/or 
overexpressed in many types of human cancer (Marcu et al t 1992; 
Nass and Dickson, 1997; Nesbit et al, 1999; Liao and Dickson, 
2000). Numerous experiments in vivo have also causally linked 
aberrant expression of this gene to the development and 
progression of cancer in different body sites (Marcu et al> 1992; 
Nass and Dickson, 1997; Nesbit et at t 1999; Liao and Dickson, 
2000). However, several critical issues regarding the significance of 
c-myc in human cancer still remain obscure. First, even for a given 
type of malignancy, the frequencies of the alterations of c-myc at 
the cytogenetic and expression levels vary gready from one report 
to another (Liao and Dickson, 2000). For instance, the frequencies 
of its amplification, mRNA and protein overexpression in breast 
cancer vary between 1-94, 22-95 and roughly 50-100%, 
respectively, among different reports (Liao and Dickson, 2000). 
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Thus, it is still unclear to what extent this gene is altered at the 
cytogenetic level and at different expression levels in breast 
carcinoma. 

One controversial issue pertains to the prognostic value of c-myc 
gene alterations in cancer. The central role of c-Myc protein in 
accelerating cell proliferation, documented by many early studies, 
has led to a general concept for many types of cancer that 
amplification or overexpression of this gene may be associated 
with a more aggressive tumour and a poorer patient survival 
(Berns et al t 1992; Marcu et al> 1992; Sato et al, 1995; Nass and 
Dickson, 1997; Nesbit et al> 1999; Visca et a/, 1999; Liao and 
Dickson, 2000). However, many reports have shown an opposite 
correlation (Sikora et al, 1985, 1987; Watson et at, 1986; Polacarz 
et al, 1989; Voravud et al t 1989; Williams et at t 1990; Melhem et al\ 
1992; Pietilainen et a/, 1995; Diebold et ah 1996; Smith and Goh, 
1996; Augenlich et at, 1997; Bieche et al> 1999), while other studies 
do not support either of these conclusions. For instance, gene 
amplification or overexpression of c-Myc protein has also been 
shown to associate with a better tumour differentiation or a better 
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patient survival for cancer of the testis, ovary, bile ducts, colon and 
breast {Sikora et al, 1985, 1987; Watson et al, 1986; Polacarz et al, 
1989; Voravud et al> 1989; Williams et al, 1990; Melhem et at, 1992; 
Pietilainen et al, 1995; Diebold et al, 1996; Smith and Goh, 1996; 
Augenlich et al, 1997; Bieche et al, 1999). This controversy does 
not appear to be related completely to the cancer type, since both 
positive (Berns et al, 1992; Visca et al, 1999) and negative 
(Williams et al, 1990; Melhem et al, 1992; Pietilainen et al, 1995; 
Smith and Goh, 1996; Augenlich et al, 1997; Bieche et al, 1999) 
correlations have been reported for colon cancer and breast 
cancer. More interestingly, c-Myc overexpression has been shown 
to predict a poorer prognosis for cutaneous melanoma, but a 
favourable outcome for uveal melanoma (Grover et at, 1997; Chana 
et al, 1998a, b, 1999; Grover et al, 1999). These data indicate 
different roles of c-Myc, even in the same type of tumour, perhaps 
depending upon different tissue microenvironments. 

Another controversial issue concerns the nuclear -cytoplasmic 
localisation of c-Myc. Studies of neoplasms of the colon, testis, 
ovary and liver have shown that predominantly nuclear localisa- 
tion of c-Myc tends to occur in benign lesions, while cytoplasmic 
localisation tends to occur in more malignant tumours (Sikora 
et al, 1985; Sundaresan et al, 1987; Melhem et al, 1992; Sasano et al, 
1992; Yuen et al, 2001), Whether these patterns of subcellular 
localisation of c-Myc tend to reflect the malignant status of breast 
cancer remains an enigma. 

A recent study of the impact of DNA amplification on gene 
expression patterns in breast cancer used mRNA and DNA from 14 
breast cancer cell lines. Analysis was conducted with a 13000 
cDNA clone array for gene expression measurement and a 
Comparative Genomic Hybridisation (CGH) microarray for gene 
copy number measurements. This study also included known 
breast cancer genes, such as c-myc, HER/2-neu and aibl (Hyman 
et al, 2002). Interestingly, 44% of the most highly amplified genes 
were also overexpressed at the mRNA level. Consistent with this 
pattern, c-Myc gene copy number and its expression levels showed 
a statistically significant (or = 0.020) correlation in this microarray 
study of breast cancer cell lines. Another study, by Pollack and 
colleagues, used microarray analysis and BAC array CGH of RNA 
and DNA (respectively) extracted from intermediate grade human 
breast tissues, and tested for amplification and expression of c- 
Myc (among other genes). This study demonstrated that two out of 
37 specimens were both amplified and overexpressed, while others 
were either amplified or overexpressed, but not both. The authors 
of this study suggested that contaminating stromal tissue may 
compress the fluorescence ratios leading to underestimates of gene 
amplification and overexpression (Pollack et al, 2002). 

To more clearly address the importance of gene amplification 
and expression of c-Myc in human breast cancer, we used in situ 
methodologies, which can clearly distinguish stromal and carci- 
noma components. We studied the amplification and over 
expression of the c-myc gene with fluorescent in situ hybridisation 
(FISH), non-radioactive in situ hybridisation (ISH) and immuno- 
histochemical (IHC) approaches on paraffin -embedded biopsy 
sections of untreated, high-grade breast cancer. It was observed 
that 70, 92 and 70% of the cancer cases exhibited c-myc gene 
amplification, its mRNA overexpression and its protein over 
expression, respectively. In most of the cases (84%) that showed 
gene amplification, the c-myc gene gained only one to two copies, 
which is consistent with c-myc FISH data from other studies. 
Unlike some oncogenes, such as N-myc, which typically demon- 
strates gene amplification copy numbers of greater then 10 in 
neuroblastoma, and HER-2/neu (Sarteiet et al, 2002), whose copy 
numbers range up to 14-40 in breast carcinomas (Isola et al, 
1999), gene copy numbers of c-myc are not as greatly increased. In 
the study noted earlier, using breast cancer cell line CGH array and 
cDNA microarray expression analysis, it was demonstrated that 
the most dramatically increased expression levels were associated 
with large gene copy number increases, although low-level gains 
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and losses had a significant influence on gene expression 
dysregulation (Hyman et al, 2002). Only one study has been 
published (Pollack et al, 2002) that has begun to determine if these 
findings are directly relevant to actual human breast tumour 
tissues, since many of the genetic changes in tissue culture cell 
lines are more extreme than those displayed in primary tumour 
material. Furthermore, the relationships among gene amplifica- 
tion, mRNA expression and c-Myc protein expression were not 
explored in prior human breast cancer cell line and tumour tissue 
studies (Hyman et al, 2002; Pollack et al, 2002). 

In our human breast tumour tissue study, a high correlation was 
found between c-myc FISH and ISH, for both percentage of 
staining (P< 0.0067) and intensity positive cells (P<0.0006). In 
addition, c-myc gene copy amplification by FISH was correlated 
with c-Myc protein expression positive cells by IHC {P< 0.0016). 
These results support the idea that c-Myc overexpression of both 
mRNA and protein is related to the copy number of the c-myc 
DNA amplification. We show in this study that amplification and 
overexpression of c-Myc occur with high frequency in high-grade 
human breast cancer tissues. 



MATERIALS AND METHODS 
Materials 

Formalin- fixed, paraffin-embedded tissue blocks of breast carci- 
noma and normal breast tissue were obtained from the 
Histopathology and Tissue Shared Resource at the Lombardi 
Comprehensive Cancer Center (LCCC), at Georgetown University 
Medical Center. The criteria for tumour selection were the 
following: negative progesterone receptor status, metastases to 
auxiliary lymph nodes and high grade (Elston Score >7). The 
oestrogen receptor status of the tumours was known from archived 
pathology reports. The parameters were chosen from our prior 
meta-analysis (Deming et al, 2000), as indications of a high 
likelihood of c-myc gene amplification. Normal breast tissue 
specimens were from reduction mammoplasty. Serial sections 
(5/im) for FISH, ISH and IHC were prepared by the LCCC 
Histopathology and Tissue Shared Resource. 

FISH 

A dual-label FISH technique was used (Jenkins et al, 1997). Slides 
were baked overnight at 60°C to assure adherence of the sample. 
Tissue sections were deparaffinised with two successive, lOmin 
xylene washes, and then dehydrated in a graded ethanol series of 
70, 80 and 95% at room temperature. Samples were then digested 
with 4% pepsin (Sigma, St Louis, MO, USA) at 45°C for lOmin. 
DNA probes used were an alpha satellite probe to chromosome 8, 
labelled with biotin, and a c-myc probe, labelled with digoxigenin 
(Ventana, Tucson, AZ, USA). Codenaturation was performed at 
90° for lOmin on a hot plate. Hybridisation was at 37°C for 12- 
1 6 h. Detection of signals was accomplished with an antiavidin 
antibody labelled with Texas Red, and an an ti digoxigenin antibody 
conjugated to fluorescein (Ventana, Tucson, AZ, USA). Slides were 
postwashed in 2 x SSC at 72°C for 5 min and counterstained with 
DAPI to visualise cell nuclei. Results were viewed and quantified 
with a Zeiss Axiophot fluorescence microscope, equipped with 
appropriate filters and an Applied Imaging Cytovision system 
(Pittsburgh, PA, USA). In this approach, the c-myc unique 
sequence probe was visualised as a green signal and the control 
probe for the chromosome 8 centromere was red, thus easily being 
distinguished when scored. 

One serial section from each tumour sample was stained with 
haematoxylin and eosin and first reviewed by a pathologist (BS), to 
help identify the tumour area of the section. This procedure 
ensured that the tumour cells, but not the normal cells, were 
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counted. Nuclei of up to 50 tumour cells were scored from each 
FISH-stained section, independendy by two investigators. Hybri- 
disation signals were averaged, and the amplification index was 
presented as the number of c-myc signals divided by the number of 
chromosome 8 centromere signals. A 1.8-fold increase was used as 
the criterion to judge the presence of c-myc gene amplification. 

In situ hybridisation 

In situ hybridisation (ISH) was carried out with a nonradioactive 
method, described previously (Liao et al, 2000a, b). One serial 
section from each specimen was hybridised overnight at 60°C with 
riboprobes, that were in vitro transcribed from the antisense or 
sense strand of an approximately 300 bp cDNA of human c-myc 
(ATCC, Manassas, VA, USA), labelled with digoxigenin-conjugated 
UTP. The sections were then incubated with an antibody against 
digoxigenin, followed by incubation with a second antibody 
conjugated to alkaline phosphatase. The signal was visualised by 
colour development with 5 bromo-4-chloro-3-indolyl phosphate 
and nitroblue tetrazolium. AH reagents were purchased from 
Boehringer Mannheim, Indianapolis, IA. To control the signal 
specificity, two serial sections were mounted on the same slide for 
hybridisation with the antisense and sense probes, respectively. 
ISH was given an intensity and percentage scores, based on 
intensity of positive staining and number of cells staining, 
respectively. Intensity scores were assigned 0, 1, 2 and 3, and 
percentage scores were assigned as 1- 1-25, 2- 26-50, 3- 51-75 
and 4- 76-100%. 

Immunohistochemistry 

Immunohistochemical staining (IHC) was performed using an 
avidin-biotin complex (ABC) method described previously (Liao 
et al t 1998). One serial section of each specimen was deparaffinised 
and blocked with 3% peroxide. Antigens were retrieved by heating 
slides in a microwave oven in 50 mM citrate buffer, pH 6.4, at 
boiling temperature, for 12min. After blocking with 6% normal 
goat serum, the section was incubated with a mouse monoclonal 
antibody to human c-Myc (9E10, Sigma Chemical Company, St 
Louis, MO, USA) at 1:100 dilution for 2h, followed by Ih 
incubation with a second antibody conjugated with biotin (Vector 
Laboratories Inc., Burlingame, CA, USA). The section was then 
incubated with peroxidase-conjugated avidin (Dako, Corporation, 
Carpinteria, CA, USA) for 30min, followed by colour development 
with diamino benzidine and peroxide. All procedures were carried 
out at room temperature. To control the signal specificity, serial 
sections from 10 tumour samples were also stained using an 
alternate c-Myc antibody (CI 9 from Santa Cruz Biotechnology 
Inc., Santa Cruz, CA, USA) at 1 : 60 dilution. This antibody resulted 
in focaliy positive staining in the rumour, but the staining intensity 
was weaker. To control the signal specificity, serial sections were 
made from five selected positive cases which were subjected to the 
same staining procedure, with a normal mouse IgG to replace the 
c-Myc antibody. This control staining did not give rise to a signal, 
demonstrating the specificity of the c-Myc antibody signal. IHC 
staining was given an intensity and percentage score based upon 
the intensity of positive staining and number of cells staining. 
Intensity scores were assigned 0, 1, 2 and 3 and percentage scores 
were assigned as 1- 1-25, 2- 26-50, 3- 51-75 and 4- 76-100%. 
Determinations were made of cellular localisation of c-Myc 
antibody staining to cytoplasm and/or nucleus in normal and 
invasive cells within each breast tumour specimen. 

Statistical analyses 

For each analysis of gene copy amplification (FISH), mRNA 
expression (ISH) and protein expression (IHC), all cases were first 
grouped as positive or negative to calculate the percentages of 



positive cases and negative cases, as described (Zar, 1974). Fisher's 
exact test was used to compare percentages, and two-sample Mest 
or Wilcoxon rank test was used to compare average scores. Both 
ISH and IHC were given intensity and percentage scores, based on 
intensity of positive staining and number of cells staining, 
respectively. As noted earlier, intensity scores were assigned 0, 1, 
2 and 3 and percentage scores were assigned as 1- 1 - 25, 2- 26-50, 
3- 51-75 and 4- 76-100%. A score of >2 for either intensity of 
staining or percentage of cells positive by ISH was assigned as 
high. For IHC, an intensity score of > 1 was assigned as high and a 
percentage score of > 3 was categorised as high. Each amplifica- 
tion index was paired with its corresponding mRNA expression 
score to calculate the coefficient r. The same method was used to 
estimate the association of the amplification indices with the c- 
Myc protein expression levels, and the association of the mRNA 
expression levels with the protein expression levels. A P-value of 
0.05 or less was used to determine the statistical significance in all 
analyses. In all, 54 pairs of normal vs invasive tissues were analysed 
using McNemars x 2 test to determine if there was a difference in 
cellular localisation of c-Myc antibody signal to nuclear or 
cytoplasmic compartments. 



RESULTS 

FISH analysis of gene amplification 

Amplification of the c-myc gene was measured by a FISH test in 46 
cases of breast cancer; Figure 1 demonstrates cells with no 
amplification (one copy of c-myc /one copy of chromosome 8 
centromere, and a moderate amplification a 3/1 ratio). Amplifica- 
tion was calculated by the number of c-myc signals divided by the 
number of chromosome 8 alpha satellite signals. A 1.8- fold 
increase cut-off was used to judge gene amplification. As shown 
in Table 1, 32 out of 46 (70%) cases were gene amplified for c-myc, 
whereas only 30% (14/46) of the cases showed amplification 
indices lower than the cut-off value. The amplification indices for 
most (84%, or 27/32) cases with gene amplification, ranged 
between 1.8- and three-fold, indicating that the locus gained up to 
two copies of c-myc in the majority of the cases. The percentage of 
cases with gene gains of three copies or higher was 1 1% (five out of 
46) of total cases analysed, or near 16% (five out of 32) of the cases 
with gene amplification, including one case (2% of total cases or 
3% of the cases with gene amplification) with the highest index of 5 
(a gain of four copies). 

In all, 28 of the breast carcinomas in this study were ER 
negative, and 14 were ER positive. The average c-myc gene 
amplification score was 1.896 (s.e. = 0.196) for ER positive and 
2.201 (s.e. = 0.157) for ER negative. Although ER-negative tumours 
had a slightly higher average c-myc score, the difference was not 
statistically significant (two-sided P = 0.252 from two-sample Mest 
and 0.251 from Wilcoxon rank test), consistent with the results of 
our prior meta-analysis of the literature (Deming et aU 2000). 



In situ hybridisation analysis of c-myc mRNA expression 

A total of 51 breast cancer samples were studied for c-Myc mRNA 
expression, with non radioactive in situ hybridisation (ISH). ISH 
results were assigned intensity and percentage scores based upon 
signal intensity of positive staining and number of cells staining 
within the sample, respectively. As shown in Table 2, 86% (44 out 
of 51) tumours were scored as high in intensity, and 92% (47out of 
51) had more than 51% positive cells, also considered as highly 
increased c-Myc expression. mRNA expression was heterogeneous 
in the breast tumour tissue, and no morphologic subtype was 
predominant in the high or low categories. One case showed no c- 
Myc ISH staining. In 79% (38/48) of cases, epithelia in normal 
mammary glands adjacent to the tumour also showed a high 
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Figure I FISH analysis of c-myc amplification in tumour cells from breast tumour tissue sections. FISH probe for human c-myc unique-sequence is seen as 
green, while the normal control signal a centromeric probe signal for chromosome 8 is shown in red. The nuclei of tumour cells were visualised by DAPI 
counter-staining, (A) 1:1 copy ratio of c-myc to chromosome 8 (c-myc/8 centromere), indicating no amplification of c-myc in tumour cells. (B) 1:3 copy 
ratio of c-myc to chromosome 8 (c-/nyc/8 centromere), a moderate amplification of the c-myc gene. 



Table I c-myc gene copy amplification analysis by FISH in poor 
prognosis human breast tumour samples 



Amplification Index 




(#c-myc signals/^ 


Percentage of samples with 


control signals)* 


FISH ratios in each category 


IX)- 17 


30% 14 out of 46 


1.8-1.99 


2036 Nine out of 46 


2.0-2.9 


3996 18 out of 46 


>30 


l\% five out of 46 



Analysis was conducted on 46 individual paraffin-embedded tissue samples with 
negative progesterone receptor status, positive lymph node involvement and high 
tumour grade. + Normal control ratio is I. 



intensity of staining. In three cases, no staining was seen in the 
normal terminal duct lobular units. Figure 2 shows representative 
fields of high, medium and low c-myc mRNA expression levels in 
invasive ductal carcinoma samples. 

Association of FISH and ISH 

c-Myc scores were dichotomised as binary variables (high or low), 
and a score of 2 or higher was categorised as high on ISH. A score 
higher than median was categorised as high from FISH studies. 
These dichotomised scores are depicted in Table 3. A Fisher's exact 
test was performed for comparing binary responses to see if there 
was any association between FISH and ISH. It was found that the 
FISH score was significantly associated with percentage of staining 
in the invasive cells (P = 0.0067, two-sided McNemar's test) and 
also with the intensity score on ISH (P = 0.0006, two-sided). 



Immunohistochemical staining of c-Myc proteins 

In total, 51 breast carcinomas, which were subjected to FISH 
analysis, and all of which also had been analysed for c-myc mRNA 
by in situ hybridisation, were also analysed for the expression of c- 
Myc protein, using immunohistochemical staining with the 9E10 
antibody. IHC results were assigned an intensity and percentage 
score based on intensity of positive staining and number of cells 
staining, respectively. Intensity scores were assigned 0, 1, 2 and 3 
and percentage scores were assigned as 0, 1- 0-25, 2- 26-50, 3- 



51-75 and 4- 76-100. For IHC, an intensity score of > 1 was 
assigned as high and a percentage score of > 3 was categorised as 
high. Figure 2 shows examples of high, medium and low levels of c- 
myc antibody staining in invasive ductal carcinoma samples. In 34 
cases, normal tissue was seen; 30 of these showed cytoplasmic 
staining and 22 had nuclear staining in terminal ductal lobular 
units. In all, 12 cases showed 1 + , 14 cases 2 + and four cases 3 + 
cytoplasmic staining. In situ hybridisation revealed positive 
staining in 46 out of 49 cases with normal tissue. Seven cases 
showed 1+, 13 cases showed 2+ and 26 cases showed 3 + 
staining by ISH. Both immunohistochemistry and in situ 
hybridisation showed diffuse positivity in adipocytes. 

Table 4 shows the staining pattern for the cohort In all, 70% (36 
out of 51) of cases showed high intensity of staining for c-Myc 
protein, while 85% (29 out of 34) of cases with detectable staining 
had more than 76% positive cells, also considered as high 
expression. To verify the staining specificity, serial sections from 
10 tumour specimens that were positive for 9E10 antibody were 
also stained using the CI 9 rabbit polyclonal anti-c-Myc antibody. 
Results revealed a staining pattern similar to 9E10. However, the 
staining intensity with CI 9 was weaker than 9E10. The specificity 
of these two antibodies was verified by Western blots in previous 
studies (Persons et a/, 1997; Liao et al, 2000b). Figure 2 shows 
results of c-Myc in situ hybridisation and immunohistochemistry 
studies on samples considered to demonstrate low, moderate and 
high levels of c-Myc expression. Analysis of c-Myc protein 
localisation results in the nucleus or cytoplasmic compartments 
of normal and invasive cells within the tumours revealed that 
nuclear staining was positive in 41% of normal cells, compared to 
22% of invasive cells (statistical significance at P = 0.01 by 
McNemar's two-sided % 2 test). The increase in relative cytoplasmic 
localisation of c-Myc protein, comparing normal (53.7%), to 
invasive cells (61.1%) was not significantly different. Thus, the data 
are consistent with partial exclusion of c-Myc from the nuclei of 
invasive breast cancer cells. 

The FISH score was significantly associated with the percentage 
positivity of invasive cells, as seen on IHC studies of c-Myc. 
However, 40% of tumours displayed a low index of c-myc gene 
amplification, but still expressed high levels of c-Myc protein 
(Table 6), indicating the possibility of other mechanisms of over 
expression unrelated to gene amplification in at least some 
tumours. The FISH score was not significantly associated with 
the intensity of IHC staining in the invasive cells (not shown), in 
contrast to the IHC percentage positivity score. 
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Table 2 c-myc mRNA in situ hybridisation (l$H) results 



Staining intensity 0 12 3 Percent positivity 12 3 4 

Number of tumour samples in each category N = 5 1 I 6 25 19 Number of tumour samples in each level category N— SI I 3 5 42 



In alt 5 1 human high-grade breast carcinomas were analysed to determine the relationships between c«Myc mRNA expression and c-myc gene in situ hybridisation results. Data 
are shown in two ways in the above table. First overall staining intensity of c-Myc-positive cells was scored as 0, I f Z 3 (low to high), and the number of tumour samples at each 
level of staining indicated on the line below. Next the percentage of tumour cells staining was scored as a 1. 2, 3. A {low to high %, as discussed in Materials and Methods). The 
number of tumours at each levd of percent cell positivity for c-Myc is then indicated on the line below 









Figure 2 Imrnunohistochernical staining and in situ hybridisation for c-Myc of three sets of invasive ductal carcinoma. (A, C and E) High (3 + ). 
intermediate (2 + ) and low ( I + ) level of staining by immunohistochemistry for c-Myc (B, D and F) High (3 + ). intermediate (2 + ) and low ( I + ) level of 
staining by in situ hybridisation. 



DISCUSSION 

Although there have been many reports on c-wyc amplification in 
human breast cancer (Liao and Dickson, 2000), there are only two 
published studies involving application of the FISH technique to 
unfixed, frozen sections (Persons et aU 1997; Visscher et al t 1997), 
and one prior study using FISH on an archival human tissue 
microarray (Schraml et al, 1999). Another recent study applied 
FISH to evaluate c-myc amplification in ductal carcinoma in situ 



(DCIS) (Aulmann et aU 2002). Using the FISH technique on 
formalin-fixed, paraffin-embedded sections, we now show that 
70% of high-grade breast cancer samples bear c-myc gene copy 
amplifications. Interestingly, the above-mentioned study, using 
FISH and focusing on DCIS, detected amplification of c-myc in 
only 20% of cases, but found a correlation of c-myc with increased 
tumour size and proliferation (Aulmann et aU 2002). 

The level of amplification of c-myc in our study ranged between 
one and four additional copies of die gene; the majority (84%) of 
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the cases with the gene amplification gained only one to two 
copies, also consistent with FISH data reported for c-myc copy 
amplification in human metastatic prostate carcinoma tissues 
(Jenkins et ah 1997). The relationship between the level of c-myc 
gene copy amplification and the level its increased mRNA 
expression has been examined previously in breast cancer cell 
lines (Hyman et ah 2002). In general, it has been concluded that 
the two scores coordinate for c-myc, as is the case for many breast 
cancer genes. However, only 44% of the highly amplified genes, in 
general, showed increased RNA expression, and only 10.5% of the 
highly overexpressed genes were gene copy-amplified in the cell 
line study (Hyman et ah 2002). Another analysis was conducted to 
study of relationships between gene amplification and expression 
of 6095 genes in 37 intermediate grade human breast tumours. 
This study demonstrated that 62% of the highly amplified genes 
also showed elevated expression; overall, a two-fold change in 
DNA copy number was associated with a 1.5-fold change in mRNA 
levels. Overall, 12% of the variation in gene expression in the 
breast tumours studied was associated with gene copy number 
variation (Pollack et ah 2002). Further study of additional human 
breast tumours, at precisely defined grades and stages, will be 
necessary in order to more fully define the relationships between 
DNA copy numbers and expression of genes. The studies we report 
here indicate higher levels of c-Myc gene amplification and 
expression, than other previous reports in breast cancer. We 
believe that this is probably the result of our analysis of individual 
tumour cells in a well-defined set of high-grade breast tumours. 
Prior c-Myc expression and amplification microarray studies used 
tumour specimens which contain normal stromal components, 



Table 3 Correlations between c-myc gene copy number (RSH) mRNA 
expression (iSH) 

FISH 



Low 



High 



(A) (SH {% cells) 

Low 

High 



(B) ISH (intensity) 
Low 



I 

\9 



2 

18 



3 
id 

P = 0.0067 



5 
id 

P = 0.0006 



Serial sections of high-grade human breast carcinomas were scored for c-myc gene 
copy number (FISH. Table I) and mRNA expression (ISH, Table 2). In (A), a positive 
correlation (P — 0.0067) was observed between tumour samples with a high 
percentage of cells demonstrating mRNA expression and a high c-myc gene copy 
number. A score of 2 or higher was classified as high on ISH. and a score of median or 
greater was categorised as high on FISH. In (B). a positive correlation (P - 0.0006) 
was shown between a high level of intensity for c-Myc RNA expression a high and c- 
myc gene copy number. Note that a pairwise comparison of FISH and ISH was not 
possible for all cases, due to incomplete overlap of cases analysed with each assay. 



potentially underestimating amplification and expression levels of 
the invasive tumour components (Pollack et ah 2002). 

Our study reports a percentage of tumours gene amplified for c- 
myc (using FISH in high-grade tumours) that is much higher than 
the average figure (15.5%) reported in the literature (Isola et ah 
2002). Most of the prior studies have employed the relatively 
insensitive Southern blot technique, and were reviewed in a 
recent meta-analysis (Deming et ah 2000). Consistent with this 
prior literature background, a recent study of 94 lobular and 
ductal breast cancers assessed amplification of c-myc by 
using a semiquantitative PGR assay and protein expression, with 



Table 5 Nuclear/cytoplasmic localisation of c-Myc comparing normal 
and invasive cells 
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Normal cells 
(frequency percent) 


Invasive cells 
(frequency percent) 




Total 


(A) Nuclear localisation 




+ 






28 


4 


32 


+ 


14 


8 


22 


Total 


42 


12 


S4 


(B) Cytoplasmic localisation 










12 


13 


25 




9 


20 


21 


Total 


21 


33 


54 



In all, 54 pairs (normal vs invasive) of tissues were analysed to answer the questions of 
(I) whether positivity of nuclear ceBs in normal tissues is different from that in 
invasive cells, and similarly (2) whether positivity of cytoplasmic cells in normal tissues 
is cfifferent from that in invasive cells. The data are summarised in the above 
contingency tables. In all, 22 normal cell specimens were positive for c-Myc staining 
(40.71%). compared to 12 specimens (22.2%) in invasive cells. The difference is 
statistically significant (P=0.0l) by McNemar's x* test (two-sided). 



Table 6 Correlation between c-Myc protein expression (IHC) and c- 
myc gene copy number (RSH) 



IHC (% cells) 




FISH 




Low 




High 


Low 


3 




0 


High 


10 




15 








P= 0.00 16 



Consecutive serial sections of high-grade human breast tumours were scored for c- 
myc gene copy number or protein expression, by immunohistochemistry (IHC). IHC 
scores were defined in the Materials and methods section. Data were analysed for 
correlations between the results. A highly significant correlation was observed 
between high c-Myc protein expression (IHC) between percent cells posiuve and 
high c-myc gene amplification (FISH). P = 0.00 1 6 from two-sided McNemars test 
Note that for 15 cases, no staining for c-Myc could be detected; these negative cases 
were not included in the correlation presented above. 



Table 4 c-Myc immunohistochemistry (IHC) results 



Staining intensity 


0 


1 


2 


3 


Percent positivity 


I 


2 


3 


4 


Number of tumour samples in each category 


15 


13 


20 


3 


Number of tumour samples in each category 


2 


2 


1 


29 



In alt. 5 1 high-grade human breast carcinomas were analysed to determine the relationships between c-Myc protein expression and c-myc gene <n situ hybndisation results. Data 
are shown in two ways in the above table First overall staining intensity of c-Myc-positive cells was scored as 0. 1 . 2. 3 (low to high), and the number of tumour samples at each 
level of staining is indicated on the line below. Next, in a random subset of these cases, the percentage of tumour cells staining was scored as 0. I. 2 3. 4 (low to high %. as 
discussed in Matenals and methods). The number of tumours at each level of percent cell positiviiy for c-Myc is indicated on the line below. 
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densitometry, after Western blot. These data showed c-myc gene 
amplification in 21% of tumours (Jenkins et at, 1997), using assays 
not based on in situ discrimination of tumour vs nontumour cells. 
The lower frequency of c-myc in this prior study is in contrast with 
the data we present here, and could be the result of the higher 
sensitivity and precision of the FISH and immunohistochemical 
methods, as distinct from quantitative PCR and Western blot 
densitometry. In addition, the 70% of amplified tumours in our 
study is also much higher than the 12% reported by Schraml et al 
(1999), using a c-myc FISH test on a tissue rnicroarray. This large 
difference may be because the arrays are prepared from cores of 
paraffin-embedded tissue, as small as 0.6 mm in diameter which 
may contain too few tumour cells for complete analysis of 
amplification of a gene, such as c-myc. c-myc is known to be 
quite heterogeneous in its gene amplification within individual 
tumours (in contrast to HER2/neu, for example) (Persons et ah 
1997). 

Most previous reports on the expression of c-myc mRNA have 
utilised Northern blot, dot blot or PCR-based approaches, while 
just a few involved in situ hybridisation, which were primarily 
performed on frozen tissue sections (Liao and Dickson, 2000). 
Normal breast tissue is dominated by adipose cells, differing 
greatly from tumour tissue in its epithelial cellularity. Thus, 
normal and tumour tissues may not be rigorously compared by 
techniques involving RNA extraction from total tissue. Therefore, 
conclusions such as 'increased expression* may be more difficult to 
make from studies with Northern blot, dot blot and PCR-based 
techniques that require RNA extraction from tissues that have not 
been fastidiously micro-dissected for selection of tumour cells. 
Using a more sensitive, nonradioactive in situ hybridisation (ISH) 
approach on formalin-fixed, paraffin-embedded sections, we 
report herein high expression of c-myc mRNA in 92% of high- 
grade breast carcinomas. This figure is much higher than the 
recently reported data (22%), obtained by using a real-time RT- 
PCR method (Bieche et al, 1999). Dilution of the RNA from 
epithelium by the RNA from adipose in normal breast tissue in this 
latest prior report may be one of the possible explanations for this 
large difference. 

In conclusion, the present study shows that approximately 70, 92 
and 70% of biopsies of untreated high-grade breast cancer exhibit 
c-myc gene amplification, mRNA overexpression and protein 
overexpression, respectively. In most cases (84%), with gene copy 



amplification, the c-myc gene gains one to two additional copies, c- 
myc gene amplification was significantly associated with expres- 
sion of its mRNA (both by intensity in invasive cells and by 
percentage positivity in invasive cells), and with expression of its 
protein (by percentage positivity in invasive cells). However, our 
data were also consistent with the prior literature on c-Myc 
(reviewed in Nass and Dickson, 1997; Liao and Dickson, 2000), 
indicating complex transcriptional, post transcriptional, transla- 
tion^ and post-translational control of c-Myc expression in vitro. 
Specifically, in Table 5 we observed that in 40% of the high-grade 
tumours tested, c-Myc protein was expressed at high levels, despite 
a lack of its gene amplification. 

It will be interesting to analyse lower grade tumours and 
premalignant lesions, with the same measurement tools, to 
determine if this c-myc amplification pattern is different, 
comparing different steps in onset and progression of the 
disease. Specifically, prior studies in fibroblasts and in human 
mammary epithelial cells (Liao et al, 1998, 2000a, b) have 
demonstrated that only a subtle deregulation of expression of c- 
Myc is sufficient to allow genomic instability. These prior cell 
biologic findings raise the question of whether c-Myc protein 
expression precedes or follows its gene amplification during the 
course of the natural history of breast cancer. It will also be 
interesting for future studies of lower grade breast cancers and 
premalignant lesions to determine whether there is evidence of 
nuclear exclusion of c-Myc protein. Indeed, nuclear exclusion 
of c-Myc in high-grade rumours could serve to attenuate its 
functions in later stages of disease progression (Liao and Dickson, 
2000). 
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I&tjacts. If these minor cell proteins differ among cells to the same extent as the 
abundant proteins, as is commonly assumed, only a small number of pro - 
in differences (perhaps several hundred) suffice to create very large differences 
in cell morphology and behavior. 

A Cell Can Change the Expression of Its Genes 
in Response to External Signals 3 

Most of the specialized cells in a multicellular organism are capable of altering 
their patterns of gene expression in response to extracellular cues. If a liver cell 
is exposed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver to increase the 
production of glucose from amino acids and other small molecules; the set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longer'present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. In fet cells, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond to glucocorticoids at all. These examples illustrate 
a general feature of cell specialization— different cell types often respond in dif- 
ferent ways to the same extracellular signal. Underlying this specialization are 
features that do not change, which give each cell type its permanently distinc- 
tive character. These features reflect the persistent expression of different sets of 
genes. 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 4 

If differences between the various cell types of an organism depend on the par- 
ticular genes that the ceils express, at what level is the control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
all of them can in principle be regulated. Thus a cell can control the proteins it 
makes by (1) controlling when and how often a given gene is transcribed (tran- 
scriptional control), (2) controlling how the primary RNA transcript is spliced or 
otherwise processed (RNA processing control), (3) selecting which completed 
mRNAs in the cell nucleus are exported to the cytoplasm (RNA transport con- 
trol), (4) selecting which mRNAs in the cytoplasm are translated by ribosomes 
(transitional control), (5) selectively destabilizing certain mRNA molecules in 
foe cytoplasm (mRNA degradation control), or (6) selectively activating, inacti- 
vating, or compartmentalizing specific protein molecules after they have been 
made (protein activity control) (Figure 9-2). 

For most genes transcriptional controls are paramount. This makes sense 
Because, of all the possible control points illustrated in Figure 9-2, only transcrip- 
tional control ensures that no superfluous intermediates are synthesized. In the 
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Figure 9-2 Six steps at which 
eucaryote gene expression can be 
controlled. Only controls that operate 
at steps 1 through 5 are discussed in 
this chapter. The regulation of protein 
activity (step 6) is discussed in 
Chapter 5; this includes reversible 
activation or inactivation by protein 
phosphorylation as well as 
irreversible inactivation by proteolytic 
degradation. 
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following sections we discuss the DNA and protein components that regulate the 
initiation of gene transcription. We return at the end of the chapter to the other 
ways of regulating gene expression. 

Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A cell typically expresses only a 
fraction ofits genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals from 
other cells. Although all of the steps involved in expressing a gene can in principle be 
regulated, for most genes the initiation of RNA transcription is the most important 
point of control 



DNA-binding Motifs in Gene 
Regulatory Proteins 5 

How does a cell determine which ofits thousands of genes to transcribe? As dis- 
cussed in Chapter 8, the transcription of each gene is controlled by a regulatory 
region of DNA near the site where transcription begins. Some regulatory regions 
are simple and act as switches that are thrown by a single signal. Other regula- 
tory regions are complex and act as tiny microprocessors, responding to a vari- 
ety of signals that they interpret and integrate to switch the neighboring gene on 
or off. Whether complex or simple, these switching devices consist of two fun- 
damental types of components: (1) short stretches of DNA of defined sequence 
and (2) gene regulatory proteins that recognize and bind to them. 

We begin our discussion of gene regulatory proteins by describing how these 
proteins were discovered. 

Gene Regulatory Proteins Were Discovered Using 
Bacterial Genetics 6 

Genetic analyses in bacteria carried out in the 1950s provided the first evidence 
of the existence of gene regulatory proteins that turn specific sets of genes on 
or off. One of these regulators, the lambda repressor, is encoded by a bacterial 
virus, bacteriophage lambda. The repressor shuts off the viral genes that code for 
the protein components of new virus particles and thereby enables the viral ge- 
nome to remain a silent passenger in the bacterial chromosome, multiplying with 
the bacterium when conditions are favorable for bacterial growth (see Figure 
6-80). The lambda repressor was among the first gene regulatory proteins to be 
characterized, and it remains one of the best understood, as we discuss later. 
Other bacterial regulators respond to nutritional conditions by shutting off genes 
encoding specific sets of metabolic enzymes when they are not needed. The lac 
repressor, for example, the first of these bacterial proteins to be recognized, turns 
off the production of the proteins responsible for lactose metabolism when this 
sugar is absent from the medium. 

The first step toward understanding gene regulation was the isolation of 
mutant strains of bacteria and bacteriophage lambda that were unable to shut 
off specific sets of genes. It was proposed at the time, and later proved, that most 
of these mutants were deficient in protfeins acting as specific repressors for these 
sets of genes. Because these proteins, like most gene regulatory proteins, are 
present in small quantities, it was difficult and time-consuming to isolate them. 
They were eventually purified by fractionating cell extracts on a series of stan- 
dard chromatography columns (see pp. 166-169). Once isolated, the pro- 
teins were shown to bind to specific DNA sequences close to the genes that they 
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Figure 9-3 Double-helical structure 
of DNA. The major and minor grooves 
on the outside of the double helix, are 
indicated. The atoms are colored as 
follows: carbon, dark blue; nitrogen 
light blue; hydrogen, white; oxfiP 1 * 
red; phosphorus, yellow. 
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Figure 9-71 A mechanism to explain 
both the marked deficiency of CG 
sequences and the presence of CG 
islands in vertebrate genomes. A 

black line marks the location of an 
unmethylated CG dinucleotide in the 
DNA sequence, while a red line marks 
the location of a methylated CG 
dinucleotide. 
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Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many spe- 
cialized animal cells can maintain their unique character when grown in culture, the 
gene regulatory mechanisms involved in creating them must be stable once estab- 
lished and heritable when the cell divides, endowing the ceU with a memory of its 
developmental history. Procaryotes and yeasts provide unusually accessible model 
systems in which to study gene regulatory mechanisms, some of which may be rel- 
evant to the creation of specialized cell types in higher eucaryotes. One such mecha- 
nism involves a competitive interaction between two (or more) gene regulatory pro- 
teins, each of which inhibits the synthesis of the other; this can create a flip-flop 
switch that switches a cell between two alternative patterns of gene expression. Di- 
rector indirect positive feedback loops, which enable gene regulatory proteins to 
perpetuate their own synthesis, provide a general mechanism for cell memory. 

In eucaryotes gene transcription is generally controlled by combinations of gene 
regulatory proteins. It is thought that each type of cell in a higher eucaryotic organism 
contains a specific combination of gene regulatory proteins that ensures the expres- 
sion of only those genes appropriate to that type of cell A given gene regulatory pro- 
tein may be expressed in a variety of circumstances and typically is involved in the 
regulation of many genes. 

In addition to diffusible gene regulatory proteins, Mierited states of chromatin 
condensation are also utilized by eucaryotic cells to regulate gene expression. In ver- 
tebrates DNA methylation also plays a part, mainly as a device to reinforce decisions 
about gene expression that are made initially by other mechanisms. 



Posttranscriptional Controls 

Although controls on the initiation of gene transcription are the predominant 
form of regulation for most genes, other controls can act later in the pathway 
from RNA to protein to modulate the amount of gene product that is made. Al- 
though these posttranscriptional controls, which operate after RNA polymerase 
nas bound to the gene's promoter and begun RNA synthesis, are less common 
transcriptional control, for many genes they are crucial. It seems that every 
step in gene expression that could.be controlled in principle is likely to be regu- 
ated under some circumstances for some genes. 

We consider the varieties of posttranscriptional regulation in temporal or- 
w» according to the sequence of events that might be experienced by an RNA 
m °lecule after its transcription has begun (Figure 9-72). 
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Figure 9-72 Possible post- 
transcriptional controls on gene 
expression. Only a few of these 
controls are likely to be used for any 
one gene. 
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Figure 6-3 Genes can be expressed 
with different efficiencies. Gene A is 
transcribed and translated much more 
efficiently than gene B.This allows the 
amount of protein A in the cell to be 
much greater than that of protein B. 
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FROM DNATO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment — most obviously by controlling 
the production of its RNA. 



Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA— 
the language of a nucleotide sequence. Hence the name transcription. 

Like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds [Figure 6-4). It differs from 
DNA chemically in two 4 respects: (1) the nucleotides in RNA are 
ribonucleotides— that is, they contain the sugar ribose (hence the name ribonu- 
cleic acid) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (C), it contains the base uracil (U) 
instead of the thymine (T) in DNA. Since U, like T, can base-pair by hydrogen- 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U). It is not uncommon, however, to find other types of base 
pairs in RNA: for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6) . As we see later in this chapter, the ability to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have structural 
and catalytic functions. 

Transcription Produces RNA Complementary to 
One Strand of DNA 

All of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 
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Figure 6-89 Protein aggregates that cause human disease. (A) Schematic illustration of the type of 
conformational change in a protein that produces material for a cross-beta filament. (B) Diagram illustrating 
the self-infectious nature of the protein aggregation that is central to prion diseases. PrP is highly unusual 
because the misfolded version of the protein, called PrP*, induces the normal PrP protein it contacts* to 
change its conformation, as shown. Most of the human diseases caused by protein aggregation are caused by 
the overproduction of a variant protein that is especially prone to aggregation, but because this structure is 
not infectious in this way, it cannot spread from one animal to another. (C) Drawing of a cross-beta filament, 
a common type of protease-resistant protein aggregate found in a variety of human neurological diseases. 
Because the hydrogen-bond interactions in a [J sheet form between polypeptide backbone atoms (see Figure 
3-9), a number of different abnormally folded proteins can produce this structure. (D) One of several 
possible models for the conversion of PrP to PrP*, showing the likely change of two a-helices into four 
p-sqrands. Although the structure of the normal protein has been determined accurately, the structure of the 
infectious form is not yet known with certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise Serpell, adapted from M. Sunde et al. J. MoJ. Bfoi, 273:729-739, 
1 997; D, adapted from S.B. Prusiner, Trends Biodtcm. Sd. 21 :482-487, 1 996.) 

animals and humans. It can be dangerous to eat the tissues of animals that con- 
tain PrP*, as witnessed most recently by the spread of BSE (commonly referred 
to as the "mad cow disease") from cattle to humans in Great Britain. 

Fortunately, in the absence of PrP*, PrP is extraordinarily difficult to convert 
to its abnormal form. Although very few proteins have the potential to misfold 
into an infectious conformation, a similar transformation has been discovered 
to be the cause of an otherwise mysterious "protein-only inheritance* observed 
in yeast cells. 

There Are Many Steps From DNA to Protein 

We have seen so far in this chapter that many different types of chemical reac- 
tions are required to produce a properly folded protein from the information 
contained in a gene (Figure 6-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with which each of the many steps 
is performed. 

We discuss in Chapter 7 that cells have the ability to change the levels of 
their proteins according to their needs. In principle, any or all of the steps in Fig- 
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ure 6-90) could be regulated by the cell for each individual protein. However, as 
we shall see in Chapter 7, the initiation of transcription is the most common 
point for a cell to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step— the transcription of its DNA sequence into an RNA 
molecule. 



Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large rihonucleoprotein assembly called a ribosome. The 
amino acids used far protein synthesis are first attached to a family of tRNA 
molecules, each of which recognizes, by complementary base-pair interactions, par- 
ticularsets of three nucleotides in the mRNA (codons). The sequence of nucleotides in 
the mRNA is then read from one end to the other in sets of three according to the 
genetic code. 

To initiate translation, a small ribosomal subunit binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule. A 
large ribosomal subunit binds to complete the ribosome and begin the elongation 
phase of protein synthesis. During this phase, aminoacyl tRNAs—each bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by forming 
complementary base pairs with the tRNA anttcodon. Each amino acid is added to the 
C-terminal end of the growing polypeptide by means of a cycle of three sequential 
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Figure 7^5 Six steps at which 
eucaryottc gene expression can be 
controlled. Controls that operate at 
steps I through 5 are discussed ip this 
chapter. Step 6, the regulation of protein 
activity, includes reversible activation or 
inactlvation by protein phosphorylation 
(discussed in Chapter 3) as well as 
irreversible inactlvation by proteolytic 
degradation (discussed in Chapter 6). 
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Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various cell types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
a cell can control the proteins it makes by (1) controlling when and how often a 
given gene is transcribed (transcriptional control), (2) controlling how the RNA 
transcript is spliced or otherwise processed (RNA processing control), (3) 
selecting which completed mRNAs in the cell nucleus are exported to the cytosol 
and detenriining where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (translarional control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure 7-5). 

For most genes transcriptional controls are paramount This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous interme- 
diates. In the following sections we discuss the DNA and protein components 
that perform this function by regulating the initiation of gene transcription. We 
shall return at the end of the chapter to the additional ways of regulating gene 
expression. 

Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A ceU typically expresses only a 
fraction of its genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals 
from other cells. Although all of the steps involved in expressing a gene can in prin- 
ciple be regulated, for most genes the initiation of RNA transcription is the most 
. important point of control 
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How does a cell determine which of its thousands of genes to transcribe? As 
mentioned briefly in Chapters 4 and 6, the transcription of each gene is con- 
trolled by a regulatory region of DNA relatively near the site where transcription 
begins. Some regulatory regions are simple and act as switches that are thrown 
a single signal. Many others are complex and act as tiny microprocessors, 
responding to a variety of signals that they interpret and integrate to switch the 
neighboring gene on or off. Whether complex or simple, these switching devices 
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occur in the germ line, the cell lineage that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ cells is inactive and highly methylated. Over long 
periods of evolutionary time, the methylated CG sequences in these inactive . 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired. However promoters of genes that remain active 
in the germ cell lineages (including most housekeeping genes) are kept 
unmethylated, and therefore spontaneous deaminations of Cs that occur with- 
in them can be accurately repaired. Such regions are preserved in modern day 
vertebrate cells as CG islands. In addition, any mutation of a CG sequence in the 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against, and some CG islands are simply the result of a higher than nor- 
mal density of critical CG sequences. 

The mammalian genome contains an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands often provides a convenient way of identify- 
ing genes in the DNA sequences of vertebrate genomes. 

Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many 
specialized animal cells can maintain their unique character through many cell 
division cycles and even when grown in culture, the gene regulatory mechanisms 
involved in creating them must be stable once established and heritable when the 
cell divides. These features endow the cell with a memory of its developmental history. 
Bacteria and yeasts provide unusually accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a competitive interac- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other; this can create a flip-flop switch that switches a cell between two alternative 
patterns of gene expression. Direct or indirect positive feedback loops, which enable 
gene regulatory proteins to perpetuate their own synthesis, provide a general mech- 
anism for cell memory. Negative feedback loops with programmed delays form the 
basis far cellular clocks. 

In eucaryotes the transcription of a gene is generally controlled by combinations 
of gene regulatory proteins. It is thought that each type of cell in a higher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type of cell A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in me regulation of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also used by eucaryotic cells to regulate gene expression. An espe- 
cially dramatic case is the inactivation of an entire X chromosome in female mam- 
mals. In vertebrates DNA methylation also functions in gene regulation, being used 
mainly as a device to reinforce decisions about gene expression that are made ini- 
tially by other mechanisms. DNA methylation also underlies the phenomenon of 
genomic imprinting in mammals, in which the expression of a gene depends on 
whether it was inherited from the mother or the father. 
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Figure 7-86 A mechanism to explain 
both the marked overall deficiency 
of CG sequences and their clustering 
into CG islands in vertebrate 
genomes. A black line marks the location 
of a CG dinucleotide in the DNA 
sequence, while a red "lollipop" indicates 
the presence of a methyl group on the 
CG dinucleotide. CG sequences that lie in 
regulatory sequences of genes that are 
transcribed in germ cells are unmethylated 
and therefore tend to be retained in 
evolution. Methylated CG sequences, on 
the other hand, tend to be lost through 
deamination of 5-methyl C toT, unless the 
CG sequence is critical for survival. 



POSTTRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene expression could be 
controlled. Indeed, one can find examples of each type of regulation, although 
any one gene is likely to use only a few of them. Controls on the initiation of 
gene transcription are the predominant form of regulation for most genes. But 
other controls can act later in the pathway from DNA to protein to modulate 
the amount of gene product that is made. Although these pdsttranscriptional 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are less common than transcriptional control, for 
many genes they are crucial. 
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Regulation of transcription 



X\tt phenoiypic differences tl>at distinguish the 
lft iious kinds or vrlls in a higher eufcaryute Are 
largely due to' differences in the expression of ' 
ac aes tbnt code fnr proteins, that is. tho.se tran- 
^rined h>* ANA polymerase M. hi principle, die 
expression or these {genes might he regulated al 
an y one or several stages. Tl>e concept of ihe 
-level of control" implies thai gene expression 
is pol necessarily an automatic process once it 
has begun. II could be regulated in a gene- 
5pecinc war at any one «f several sequential 
steps. We can distinguish <ni least) five poten- 
lial control points, forming the series: 

Ictivalhm of gene structure 
1 

'Initial inn of transcript inn 
i 

Processing Ihe transcript 
i 

transport to rvtoplasm 
i 

Translation of mKNA 

The existence of Ihe first Mew is implied by 
ihe discovery that genes may exist in either of 
two structural conditions. Helm he ti> ihe stale 
uf most ol the genome, genes are found in 
an 'active" state hi the cells in which they 
are expressed (see Chapter 27), The clvange of 
structure is distinct from the act of transcript 
lion, and indicates that the gene is nranscrib- 
able." This suggests thai acquisition of the 
"active* structure must be the Rrst step in gene 
expression. 

Transcription of a gene in the active state is 



controlled al Ihe stage of initiation, thai Is. by 
the interaction of RNA polymerase with its pro- 
moter. This Is now becoming susceptible to 
analysis In the hi vitro systejns (see Chapter 
For most genes, this Is a malnr control 
point; probably it is the most common level of 
regulation. 

There is ol present no evidence for control 
at subsequent stages of transcription in eukary- 
olic cells. Tor example* via antiterminalton 
mechanisms. 

The primary transcript Is modified by capping 
al the 5* end. and usually also by polyadenyla- 
tion at the 3' end. Inirons must l>e spliced out 
from the transcripts of interrupted genes. The 
mature UNA must be exported from Ihe nucleus 
to Hie cvtoplasm. Regulation of gene expression 
by selection of sequences at the levef of nuclear 
UNA might involve any or all of these slages, 
but the one for which we have most evidence 
concerns changes in splicing: some genes are 
expressed by means of alternative splicing pat- 
terns whose regulation controls the type of pro- 
tein product (see Chapter .to). 

Finally. Ihe translation of no mKNA In the cyto- 
plasm can he specifically controlled. There is little 
evidence for Ihe employment of this mechanism in 
adult somatic cells, but it does occur in some 
embryonic situations, as described in Chapter 
-The mechanism is presumed to involve the block- 
ing of Inlllfttlon of translation or some mANAs by 
specific protein factors. 

But having acknowledged that control of gene 
expression can occur at multiple stages, and 
that production of ANA cannot inevitably be 
equaled with production of protein, it Is clear 
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that che overwhelming majority of regulatory 
. events occur at the initiation or transcription. 
Regulation of tissue -specific gene transcription 
lies at the heart of eukaryouc differentiation; 
indeed, we see examples in Chapter 38 in 
which proteins Hut regulate embryonic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcription factor serves to provide 



common control of a large number or -Urga 
genes, and we seek to answer two questions 
about this mode or regulation: wha,t identifies 
the common target genes to the transcription 
factor; and how is the activity of the transcrip- 
tion factor iiself regulated in response to imrip- 
sic or extrinsic signals? 



Response elements identify genes under common 
regulation 



The principle thai emerges from characterizing 
groups of genes under common control is that 
they share a prompter clantnl thai is recognized 
by a regulatory transaiption factor An element 
that causes a gene to respond to such a factor 
is called a response element; examples are the 
HSE (heat shock response element), ORE 
(glucocorticoid response element), SRE (scrum 
response element). 

The properties of some inducible transcription 
factors and the elements that they recognise are 
summarized in. Table 29.1. Response elements 
have the same general characteristics as 
upstream elements of promoters or enhancers. 
They contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by the factor 
extends for- a short distance on either side of 



Table 29.1 Inducible" transcnpi. on facicrs bind ic 
response etemsnis that idont.fy groups cf pomciers 
or enhancers subject to coordinate? cor.ircl. 



Regulatory Agent Module Consensus 



Factor 



Heat stock HSE CNNGAANNTCCNNG HSTf 

Glucocorticoid GRE TGGTACMATGTTCT Receptor 

Phorool aster TRE TGACTCA API 

Serum SRE CCATATTAGG SHF 



the consensus sequence. Ln promoters, the ele- 
ments are not present a! fixed distances from 
Ihe staripoint, but are usually <200 Up upstream 
or It. The presence of a single element usually 
is sufficient to confer the regulatory respond 
but sometimes there are multiple copies. 

Response elements may be located in pt°~ 
moters or In enhancers. Some types of element* 
are typically found in one rather than the other, 
usually an HSE is found fin a promoter, while ' 
GK£ is found in an enhancer. We assume tM 
all response elements function by the & ntt 
general principle. A gene is regulated k* * 
sequence at Oie promoter or enhancer (hat P 
recognized by a specific protein. The P** 1 * 1 * 
Junctions as a transcription factor needed fl* m 
UNA polymerase to initiate. ' Active protein * 
available only under conditions when the 
to be expressed; its absence means that tht 
moter is not activated by this particular a* 1 "' 

An example of a situation in which ^ 
genes are controlled by a single factor & 
\1ded by the heal shock response. This * ^ 
mqn lo a wide range of prokaryotes a ^ 
eukarvoles and Involves multiple c° nl ^ S J>f r 
gene expression: an increase in temp**' p(i 
turns off transcription of some genes. luff,s ^ 
transcription of ihe heat shock B tneSt ^^ 
causes changes in the translation of & ' 
The control of the heal shock genes j 



the differences 



between prokaryot* ^ f%1 
eukaryotic modes of controL In. bacteria, a ^ 
Sigma factor is synthesized that ^ rcC,S 9 pff* 
polymerase holoenzyme to recognize <* n - 
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Background: Prostate stem cell antigen (PSCA) is a recently defined homologue of the Thy-l/Ly-6 family of 
gtycosylphosphatidylinositol (GPI)-anchored cell surface antigens. The purpose of the present study was to 
examine the expression status of PSCA protein and mRNA in clinical specimens of human prostate cancer (Pea) 
and to validate it as a potential molecular target for diagnosis and treatment of Pea. 

Materials and Methods: Immunohistochemical (IHQ and in situ hybridization (ISH) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections from 20 benign prostatic hyperplasia 
(BPH), 20 prostatic intraepithelial neoplasm (PIN) and 48 prostate cancer (Pea) tissues, including 9 androgen- 
independent prostate cancers. The level of PSCA expression was semiquantitatlvely scored by assessing both the 
percentage and intensity of PSCA-positive staining cells in the specimens. Then compared PSCA expression 
between BPH, PIN and Pea tissues and analysed the correlations of PSCA expression level with pathological grade, 
clinical stage and progression to androgen-independence in Pea. 

Results: In BPH and low grade PIN, PSCA protein and mRNA staining were weak or negative and less intense 
and uniform than that seen in HGPIN and Pea. There were moderate to strong PSCA protein and mRNA 
expression In 8 of 1 1 (72.7%) HGPIN and in 40 of 48 (83.4%) Pea specimens examined by IHC and ISH analyses, 
with statistical significance compared with 8PH (20%) and low grade PIN (22.2%) samples (p < 0.05, respectively). 
The expression level of PSCA increased with high Gleason grade, advanced stage and progression to androgen- 
independence (p < 0.05, respectively). In addition. IHC and ISH staining showed a high degree of correlation 
between PSCA protein and mRNA overexpression. 

Conclusions: Our data demonstrate that PSCA as a new cell surface marker is overexpressed by a majority of 
human Pea. PSCA expression correlates positively with adverse tumor characteristics, such as increasing 
pathological grade (poor cell differentiation), worsening clinical stage and androgen -independence, and 
speculatively with prostate carcinogenesis. PSCA protein overexpression results from upregulated transcription 
of PSCA mRNA. PSCA may have prognostic utility and may be a promising molecular target for diagnosis and 
treatment of Pea. 
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Introduction 

Prostate cancer (Pea) is the second leading cause of can- 
cer-related death in American men and is becoming a 
common cancer increasing in China, Despite recently 
great progress in the diagnosis and management of local- 
ized disease, there continues to be a need for new diagnos- 
tic markers that can accurately discriminate between 
indolent and aggressive variants of Pea. There also contin- 
ues to be a need for the identification and characterization 
of potential new therapeutic targets on Pea cells. Current 
diagnostic and therapeutic modalities for recurrent and 
metastatic Pea have been limited by a lack of specific tar- 
get antigens of Pea. 

Although a number of prostate-specific genes have been 
identified (i.e. prostate specific antigen, prostatic acid 
phosphatase, glandular kallikrein 2), the majority of these 
are secreted proteins not ideally suited for many immuno- 
logical strategies. So, the identification of new cell surface 
antigens is critical to the development of new diagnostic 
and therapeutic approaches to the management of Pea. 

Reiter RE et al (1] reported the identification of prostate 
stem cell antigen (PSCA), a cell surface antigen that is pre- 
dominantly prostate specific. The PSCA gene encodes a 
123 amino acid glycoprotein, with 30% homology to 
stem ceil antigen 2 (Sea 2). Like Sca-2, PSCA also belongs 
to a member of the Thy-l/Ly-6 family and is anchored by 
a glycosylphosphatidylinositol (GPI) linkage. mRNA in 
situ hybridization (ISH) localized PSCA expression in nor- 
mal prostate to the basal cell epithelium, the putative 
stem cell compartment of prostatic epithelium, suggesting 
that PSCA may be a marker of prostate stem/progenitor 
cells. 

In order to examine the status of PSCA protein and mRNA 
expression in human Pea and validate it as a potential 
diagnostic and therapeutic target for Pea, we used immu- 
nohistochemistry (IHC) and in situ hybridization (ISH) 
simultaneously, and conducted PSCA protein and mRNA 
expression analyses in paraffin-embedded tissue speci- 
mens of benign prostatic hyperplasia (BPH, n * 20), pros- 
tate intraepithelial neoplasm (PIN, n = 20) and prostate 
cancer (Pea, n = 48). Furthermore, we evaluated the possi- 
ble correlation of PSCA expression level with Pea tumori- 
genesis, grade, stage and progression to androgen- 
independence. 

Materials and methods 
Tissue samples 

All of the clinical tissue specimens studied herein were 
obtained from 80 patients of 57-84 years old by prostate- 
ctomy, transurethral resection of prostate (TURP) or biop- 
sies. The patients were classified as 20 cases of BPH, 20 
cases of PIN, 40 cases of primary Pea, including 9 patients 



with recurrent Pea and a history of androgen ablation 
therapy (orchiectomy and/or hormonal therapy), who 
were referred to as androgen-independent prostate can- 
cers. Eight specimens were harvested from these andro- 
gen-independent Pea patients prior to androgen ablation 
treatment. Each tissue sample was cut into two parts, one 
was fixed in 10% formalin for IHC and the other treated 
with 4% paraformaldehyde/0.1 M PBS PH 7.4 in 0.1% 
DEPC for 1 h for ISH analysis, and then embedded in par- 
affin. All paraffin blocks examined were then cut into 5 
um sections and mounted on the glass slides specific for 
IHC and ISH respectively in the usual fashion. H&E- 
stained section of each Pea was evaluated and assigned a 
Gleason score by the experienced urological pathologist at 
our institution based on die criteria of Gleason score (2). 
The Gleason sums are summarized in Table 1. Clinical 
staging was performed according to Jewett-whitmore- 
prout staging system, as shown in Table 2. In the category 
of PIN, we graded the specimens into two groups, i.e. low 
grade PIN (grade I - II) and high grade PIN (HGP1N, 
grade III) on the basis of literatures [3,4]. 

tmmunohistochemicat (IHC) analysis 
Briefly, tissue sections were deparafrlnized, dehydrated, 
and subjected to micro waving in 10 mmol/L citrate 
buffer, PH 6.0 (Boshide, Wuhan, China) in a 900 W oven 
for 5 min to induce epitope retrieval. Slides were allowed 
to cool at room temperature for 30 min. A primary mouse 
antibody specific to human PSCA (Boshide, Wuhan, 
China) with a 1: 100 dilution was applied to incubate with 
the slides at room temperature for 2 h. Labeling was 
detected by sequentially adding biotinylated secondary 
antibodies and strepavidin-peroxidase, and localized 
using 3,3'-diaminobenzidine reaction. Sections were then 
counterstained with hematoxylin. Substitution of die pri- 
mary antibody with phosphate-bufFered-saline (PBS) 
served as a negative-staining control. 

mRNA in situ hybridization (ISH) 

Five-um-thick tissue sections were deparaffinized and 
dehydrated, then digested in pepsin solution (4 mg/ml in 
3% citric acid) for 20 min at 37.5 °C, and further proc- 
essed for ISH. Digoxigenin-labeled sense and antisense 
human PSCA RNA probes (obtained from Boshide, 
Wuhan, China) were hybridized to the sections at 48 °C 
overnight. The posthybridization wash with a high strin- 
gency was performed sequentially at 37°C in 2 * standard 
saline citrate (SSC) for 10 min, in 0.5 * SSC for 15 min 
and in 0.2 * SSC for 30 min. The slides were then incu- 
bated to biotinylated mouse anti-digoxigenin antibody at 
37.5°Cforl h followed by washing in 1 * PBS for 20 min 
at room temperature, and then to strepavidin-peroxidase 
at 37.5 °C for 20 min followed by washing in 1 x PBS for 
15 min at room temperature. Subsequently, the slides 
were developed with diaminobenzidine and then coun- 
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Table i: Correlation of PSCA expression with Gleason score 







Intensity x 


frequency 




Gleason score 


0-6 (%) 






9(%) 


2-4 
5-7 
8-I0 


5(83) 
19(79) 
5(28) 






1 (17) 
5(21) 
13(72) 




Table 2: Correlation of PSCA expression with clinical stage 






Intensity * 


frequency 




Tumor stage 


0-6 {%) 






9(%) 




27 (67.5) 
2(25) 






13 (32.S) 
6(75) 



terstained with hematoxylin to localize the hybridization 
signals. Sections hybridized with the sense control probes 
routinely did not show any specific hybridization signal 
above background. All slides were hybridized with PBS to 
substitute for the probes as a negative control. 

Scoring methods 

To determine the correlation between the results of PSCA 
immunostaining and mRNA in situ hybridization, the 
same scoring manners are taken in the present study for 
PSCA protein staining by 1HC and PSCA mRNA staining 
by ISH. Each slide was read and scored by two independ- 
endy experienced urological pathologists using Olympus 
BX-41 light microscopes. The evaluation was done in a 
blinded fashion. For each section, five areas of similar 
grade were analyzed semiquantitativeiy for the fraction of 
cells staining. Fifty percent of specimens were randomly 
chosen and rescored to determine the degree of interob- 
server and intraobserver concordance. There was greater 
than 95% intra- and interobserver agreement. 

The intensity of PSCA expression evaluated microscopi- 
cally was graded on a scale of 0 to 3+ with 3 being the 
highest expression observed (0/ no staining; 1+, mildly 
intense; 2+, moderately intense; 3+, severely intense). The 
staining density was quantified as the percentage of cells 
staining positive for PSCA with the primary antibody or 
hybridization probe, as follows: 0 = no staining; 1 = posi- 
tive staining in <25% of the sample; 2 » positive staining 
in 25%-50% of the sample; 3 = positive staining in >50% 



of the sample. Intensity score (0 to 3+) was multiplied by 
the density score (0-3) to give an overall score of 0-9 
[1,5], In this way, we were able to differentiate specimens 
that may have had focal areas of increased staining from 
those that had diffuse areas of increased staining [6]. The 
overall score for each specimen was then categorically 
assigned to one of the following groups: 0 score, negative 
expression; 1-2 scores, weak expression; 3-6 scores, mod- 
erate expression; 9 score, strong expression. 

Statistical analysis 

Intensity and density of PSCA protein and mRNA expres- 
sion in BPH, PIN and Pea tissues were compared using the 
Chi -square and Student's c-test. Univariate associations 
between PSCA expression and Gleason score, clinical 
stage and progression to androgen- independence were 
calculated using Fisher's Exact Test. For all analyses, p < 
0.05 was considered statistically significant. 

Results 

PSCA expression in BPH 

In general, PSCA protein and mRNA were expressed 
weakly in individual samples of BPH. Some areas of 
prostate expressed weak levels (composite score 1-2), 
whereas other areas were completely negative (composite 
score 0). Four cases (20%) of BPH had moderate expres- 
sion of PSCA protein and mRNA (composite score 4-6) 
by IHC and ISH. In 2/20 (10%) BPH specimens, PSCA 
mRNA expression was moderate (composite score 3-6), 
but PSCA protein expression was weak (composite score 
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2) in one and negative (composite score 0) in the other. 
PSCA expression was localized to the basal and secretory 
epithelial cells, and prostatic stroma was almost negative 
staining for PSCA protein and mRNA in all cases 
examined. 

PSCA expression in PIN 

In this study, we detected weak or negative expression of 
PSCA protein and mRNA (£2 scores) in 7 of 9 (77.8%) 
low grade PIN and in 2 of 11 (18.2%) HGPIN, and mod- 
erate expression (3-6 scores) in the rest 2 low grade PIN 
and 5 of 1 1 (45.5%) HGPIN. One HGPIN with moderate 
PSCA mRNA expression (6 score) was found weak stain- 
ing for PSCA protein (2 score) by IHC. Strong PSCA pro- 
tein and mRNA expression (9 score) were detected in the 
remaining 3 of 11 (27.3%) HGPIN. There was a statisti- 
cally significant difference of PSCA protein and mRNA 
expression levels observed between HGPIN and BPH (p < 
0.05), but no statistical difference reached between low 
grade PIN and BPH (p > 0.05). 

PSCA expression in Pea 

In order to determine if PSCA protein and mRNA can be 
detected in prostate cancers and if PSCA expression levels 
are increased in malignant compared with benign glands, 
Forty-eight paraffin-embedded Pea specimens were ana- 
lysed by IHC and ISH. It was shown that 19 of 48 (39.6%) 
Pea samples stained very strongly for PSCA protein and 
mRNA with a score of 9 and another 21 (43.8%) speci- 
mens displayed moderate staining with scores of 4-6 (Fig- 
ure 1). In addition, 4 specimens with moderate to strong 
PSCA mRNA expression (scores of 4-9) had weak protein 
staining (a score of 2) by IHC analyses. Overall, Pea 
expressed a significantly higher level of PSCA protein and 
mRNA than any other specimen category in this study (p 
< 0.05, compared with BPH and PIN respectively). The 
result demonstrates that PSCA protein and mRNA are 
overexpressed by a majority of human Pea. 

Correlation of PSCA expression with Gleason score in Pea 

Using the semi-quantitative scoring method as described 
in Materials and Methods, we compared the expression 
level of PSCA protein and mRNA with Gleason grade of 
Pea, as shown in Table 1. Prostate adenocarcinomas were 
graded by Gleason score as 2-4 scores = well-differentia- 
tion, 5-7 scores = moderate-differentiation and 8-10 
scores « poor-differentiation [7], Seventy-two percent of 
Gleason scores 8-10 prostate cancers had very strong 
staining of PSCA compared to 21% with Gleason scores 
5-7 and 17% with 2-4 respectively, demonstrating that 
poorly differentiated Pea had significantly stronger 
expression of PSCA protein and mRNA than moderately 
and well differentiated tumors (p < 0.05). As depicted in 
Figure 1, IHC and ISH analyses showed that PSCA protein 
and mRNA expression in several cases of poorly differen- 



tiated Pea were particularly prominent, with more intense 
and uniform staining. The results indicate that PSCA 
expression increases significantly with higher tumor grade 
in human Pea. 

Correlation of PSCA expression with clinical stage in Pea 

With regards to PSCA expression in every stage of Pea, we 
showed the results in Table 2. Seventy-five percent of 
locally advanced and node positive cancers (i.e. C-D 
stages) expressed statistically high levels of PSCA versus 
32.5% that were organ confined (i.e. A-B stages) (p < 
0.05). The data demonstrate that PSCA expression 
increases significantly with advanced tumor stage in 
human Pea. 

Correlation of PSCA expression with androgen- 
independent progression of Pea 

All 9 specimens of androgen-independent prostate can- 
cers stained positive for PSCA protein and mRNA. Eight 
specimens were obtained from patients managed prior to 
androgen ablation therapy. Seven of eight (87.5%) of 
these androgen-independent prostate cancers were in the 
strongest staining category (score = 9), compared with 
three out of eight (37.5%) of patients with androgen- 
dependent cancers (p < 0.05). Hie results demonstrate 
that PSCA expression increases significantly with progres- 
sion to and ro gen-independence of human Pea. 

It is evident from the results above that within a majority 
of human prostate cancers the level of PSCA protein and 
mRNA expression correlates significantly with increasing 
grade, worsening stage and progression to androgen-inde- 
pendence. 

Correlation of PSCA immunostainlng and mRNA in situ 
hybridization 

In all 88 specimens surveyed herein, we compared the 
results of PSCA IHC staining with mRNA ISH analysis. 
Positive staining areas and its intensity and density scores 
evaluated by IHC were identical to those seen by ISH in 79 
of 88 (89.8%) specimens (18/20 BPH, 19/20 PIN and 42/ 
48 Pea respectively). Importantly, 27/27 samples with 
PSCA mRNA composite scores of 0-2, 32/36 samples 
with scores of 3-6 and 22/24 samples with a score of 9 
also had PSCA protein expression scores of 0-2, 3-6 and 
9 respectively. However, in 5 samples with PSCA mRNA 
overall scores of 3-6 and in 2 with scores of 9 there were 
less or negative PSCA protein expression (i.e. scores of 0- 
4), suggesting that this may reflect posttranscriptional 
modification of PSCA or that the epitopes recognized by 
PSCA mAb may be obscured in some cancers. The data 
demonstrate that the results of PSCA immunostaining 
were consistent with those of mRNA ISH analysis, show- 
ing a high degree of correlation between PSCA protein 
and mRNA expression. 
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Figure I 

Representatives of PSCA IHC and ISH staining in Pea (A. IHC staining, B. ISH staining, x200 magnification). A,, &,: negative con- 
trol of IHC and ISH. PBS replacing the primary antibody (A,) and hybridization with a sense PSCA probe (B,) showed no back- 
ground staining. A 2 » Bj: a moderately differentiated Pea (Gleason score = 3+3 ~ 6) with moderate staining (composite score = 
6) in all malignant cells; A 2 : IHC shows not only cell surface but also apparent cytoplasmic staining of PSCA protein. A 3 , B 3 : a 
poorly differentiated Pea (Gleason score = 4+4 = 8) with very strong staining (composite score = 9) in all malignant cells. 
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Discussion 

PSCA is homologous to a group of cell surface proteins 
that mark the earliest phase of hematopoietic develop- 
ment. PSCA mRNA expression is prostate-specific in nor- 
mal male tissues and is highly up-regulated in both 
androgen-dependem and-independent Pea xenografts 
(IAPC-4 tumors). We hypothesize that PSCA may play a 
role in Pea tumorigenesis and progression, and may serve 
as a target for Pea diagnosis and treatment. In this study, 
IHC and ISH showed that in general there were weak or 
absent PSCA protein and mRNA expression in BPH and 
low grade PIN tissues. However, PSCA protein and mRNA 
are widely expressed in HGPIN, the putative precursor of 
invasive Pea, suggesting that up-regulation of PSCA is an 
early event in prostate carcinogenesis. Recently, Reiter RE 
etal [1], using ISH analysis, reported that 97 of 118 (82%) 
HGPIN specimens stained strongly positive for PSCA 
mRNA. A very similar rinding was seen on mouse PSCA 
(mPSCA) expression in mouse HGPIN tissues by Tran C. 
P et al [8]. These data suggest that PSCA may be a new 
marker associated with transformation of prostate cells 
and tumorigenesis. 

We observed that PSCA protein and mRNA are highly 
expressed in a large percentage of human prostate cancers, 
including advanced, poorly differentiated, androgen- 
independent and metastatic cases. Fluorescence-activated 
cell sorting and confocal/ immunofluorescent studies 
demonstrated cell surface expression of PSCA protein in 
Pea cells [9). Our IHC expression analysis of PSCA shows 
not only cell surface but also apparent cytoplasmic stain- 
ing of PSCA protein in Pea specimens (Figure 1). One pos- 
sible explanation for this is that anti-PSCA antibody can 
recognize PSCA peptide precursors that reside in the cyto- 
plasm. Also, it is possible that the positive staining that 
appears in the cytoplasm is actually from the overlying 
cell membrane [5]. These data seem to indicate that PSCA 
is a novel cell surface marker for human Pea. 

Our results show that elevated level of PSCA expression 
correlates with high grade (i.e. poor differentiation), 
increased tumor stage and progression to androgen-inde- 
pendence of Pea. These findings support the original IHC 
analyses by Gu Z et al [9], who reported that PSCA protein 
expressed in 94% of primary Pea and the intensity of 
PSCA protein expression increased with tumor grade, 
stage and progression to androgen-independence. Our 
results also collaborate the recent work of Han KR et al 
[10], in which the significant association between high 
PSCA expression and adverse prognostic features such as 
high Gleason score, seminal vesicle invasion and capsular 
involvement in Pea was found. It is suggested that PSCA 
overexpression may be an adverse predictor for recur- 
rence, clinical progression or survival of Pea. Hara H et al 
|11] used RT-PCR detection of PSA, PSMA and PSCA in 1 



ml of peripheral blood to evaluate Pea patients with poor 
prognosis. The results showed that among 58 PCa 
patients, each PCR indicated the prognostic value in the 
hierarchy of PSCA>PSA>PSMA RT-PCR* and extraprostatic 
cases with positive PSCA PCR indicated lower disease-pro- 
gression-free survival than those with negative PSCA PCR, 
demonstrating that PSCA can be used as a prognostic fac- 
tor. Dubey P et al |12] reported that elevated numbers of 
PSCA + cells correlate positively with the onset and devel- 
opment of prostate carcinoma over a long time span in 
the prostates of the TRAMP and PTEN +/- models com- 
pared with its normal prostates. Taken together with our 
present findings, in which PSCA is overexpressed from 
HGPIN to almost frank carcinoma, it is reasonable and 
possible to use increased PSCA expression level or 
increased numbers of PSCA-positive cells in the prostate 
samples as a prognostic marker to predict the potential 
onset of this cancer. These data raise the possibility that 
PSCA may have diagnostic utility or clinical prognostic 
value in human Pea. 

The cause of PSCA overexpression in Pea is not known. 
One possible mechanism is that it may result from PSCA 
gene amplification. In humans, PSCA is located on chro- 
mosome 8q24.2 (1], which is often amplified in meta- 
static and recurrent Pea and considered to indicate a poor 
prognosis J 13-1 5]. Interestingly, PSCA is in close proxim- 
ity to the c-myc oncogene, which is amplified in >20% of 
recurrent and metastatic prostate cancers [16,17]. Reiter 
RE et al 1 18| reported that PSCA and MYC gene copy num- 
bers were co-amplified in 25% of tumors (five out of 
twenty), demonstrating that PSCA overexpression is asso- 
ciated with PSCA and MYC coamplification in Pea. Gu Z 
et al [9] recently reporteted that in 102 specimens availa- 
ble to compare the results of PSCA immunostaining with 
their previous mRNA ISH analysis, 92 (90.2%) had iden- 
tically positive areas of PSCA protein and mRNA expres- 
sion. Taken together with our findings, in which we 
detected moderate to strong expression of PSCA protein 
and mRNA in 34 of 40 (85%) Pea specimens examined 
simultaneously by IHC and ISH analyses, it is demon- 
strated that PSCA protein and mRNA overexpressed in 
human Pea, and that the increased protein level of PSCA 
was resulted from the upregulated transcription of its 
mRNA. 

At present, the regulation mechanisms of human PSCA 
expression and its biological function are yet to be eluci- 
dated. PSCA expression may be regulated by multiple fac- 
tors [18]. WatabeTet al [19] reported that transcriptional 
control is a major component regulating PSCA expression 
levels. In addition, induction of PSCA expression may be 
regulated or mediated through cell-cell contact and pro- 
tein kinase C (PKC) [20]. Homologues of PSCA have 
diverse activities, and have themselves been involved in 
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carcinogenesis. Signalling through SCA-2 has been dem- 
onstrated to prevent apoptosis in immature thymocytes 
[21]. Thy-1 is involved in T cell activation and transducts 
signals through src-like tyrosine kinases [22). Ly-6 genes 
have been implicated both in tumorigenesis and in cell- 
cell adhesion (23-25). Cell-cell or cell-matrix interaction is 
critical for local tumor growth and spread to distal sites. 
From its restricted expression in basal cells of normal 
prostate and its homology to SCA-2, PSCA may play a role 
in stem/progenitor cell function, such as self-renewal (i.e. 
anti-apoptosis) and/or proliferation [1). Taken together 
with the results in the present study, we speculate that 
PSCA may play a role in tumorigenesis and clinical pro- 
gression of Pea through affecting cell transformation and 
proliferation. From our results, it is also suggested that 
PSCA as a new cell surface antigen may have a number of 
potential uses in the diagnosis, therapy and clinical prog- 
nosis of human Pea. PSCA overexpression in prostate 
biopsies could be used to identify patients at high risk to 
develop recurrent or metastatic disease, and to discrimi- 
nate cancers from normal glands in prostatectomy sam- 
ples. Similarly, the detection of PSCA-overexpressing cells 
in bone marrow or peripheral blood may identify and pre- 
dict metastatic progression better than current assays, 
which identify only PSA-positive or PSMA-positive pros- 
tate cells. 

In summary, we have shown in this study that PSCA pro- 
tein and mRNA are maintained in expression from 
HGPIN through ail stages of Pea in a majority of cases, 
which may be associated with prostate carcinogenesis and 
correlate positively with high tumor grade (poor ceil dif- 
ferentiation), advanced stage and androgen-independent 
progression. PSCA protein overexpression is due to the 
upregulation of its mRNA transcription. The results sug- 
gest that PSCA may be a promising molecular marker for 
the clinical prognosis of human Pea and a valuable target 
for diagnosis and therapy of this tumor. 
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DECLARATI ON OF VICTORIA SMITH. Ph.D.. UNDER 37 CFR S1J32 

Commissioner for Patents 

P.O. Box 1450 

Alexandria, VA 22313-1450 

Dear Sir: 

I, Dr. Victoria Smith, declare and state as follows: 

1. I am a Senior Scientist in the Department of Molecular Biology of Genentech, 
Inc., 1 DNA Way, South San Francisco, CA 94080. 

. 2. My scientific Curriculum Vitae, including my list of publications, is attached to 
and forms part of this Declaration (Exhibit A). 

3. I joined Genentech in 1996. For approximately three years, I directed a laboratory 
in the Department of Molecular Biology. During this time I was involved in target discovery for 
the Tumor Antigen Project, using DNA microarrays to discover genes differentially expressed in 
ramors compared to their expression in normal tissues. In connection with the above-identified 
patent application, I directed the generation and analysis of the microarray data attached as 
Exhibit B. 

4. Exhibit B reports the results of the microarray analysis conducted on the gene 
encoding PR0539 (DNA47465) as part of the investigation of several newly discovered DNA 
sequences. The column "Unq Id" identify the gene as 340, which is DNA47465, while the 
column "DNA Id" identifies the particular lot of PCR product used. The microarray experiments 
were performed using well-established and accepted microarray techniques known in the art. 
{See, e;g., Nature Revs. Genetics, 5:229-237 (2004), attached as Exhibit C). The DNA samples 
used in the microarray studies were obtained from individual lung tumor tissue samples or 
individual normal lung tissue samples. The individual tumor and normal lung samples were each 
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compared to pooled samples of normal epithelial tissue. The level of expression in the lung 
tumor or normal lung tissue sample was compared to the normal pooled epithelial sample, and 
reported as a raw ratio. The average of the normal lung samples was then used to normalize the 
data to generate a ratio of expression of the PR0539 gene in lung tumor samples compared to the 
average expression in normal lung tissue. In the results reported in Exhibit B, a ratio of 2.0 or 
greater is a significant result, and indicates a significant increase in expression of the PR0539 
gene in lung tumor tissue compared to the normal lung tissue controls. 

5. The results of the microairay studies reported in Exhibit B indicate that the gene 
encoding PR0539 (DNA47465) is significantly overexpressed in eight of the twenty-six lung 
tumor samples tested compared to the normal lung tissue controls. That is the equivalent of 
nearly one in every three samples (31%). In addition, four out of five squamous cell lung 
carcinomas (80%) are significantly overexpressed (shown in bold). In contrast, only one of the 
seven individual normal lung tissue samples shows significant overexpression of the PR0539 
gene (14%). 

6. It is well-established in the art that overexpression of the mRNA for a gene is 
likely to lead to overexpression of the corresponding protein. Support for this statement can be 
found, for example, in the Molecular Biology of the Cell, a leading textbook in the field. (Bruce 
Alberts, et aL, Molecular Biology of the Cell (4 th ed. 2002), excerpts submitted herewith as 
Exhibit D). Figure 6-3 on page 302 illustrates the basic principle that there is a correlation 
between increased gene expression and increased protein expression. The accompanying text 
states that "a cell can change (or regulate) the expression of each of its genes according to the 
needs of the moment - most obviously by controlling the production of its mRNA." Molecular 
Biology of the Cell at 302, emphasis added. Similarly, figure 6-90 on page 364 illustrates the 
path from gene to protein. The accompanying text states that while potentially each step can be 
regulated by the cell, "the initiation of transcription is the most common point for a cell to 
regulate the expression of each of its genes." Molecular Biology of the Cell at 364. This point is 
repeated on page 379, where the authors state that of all the possible points for regulating protein 
expression, "[f]or most genes transcriptional controls are paramount" Molecular Biology of the 
Cell at 379. . . bJ 



7. While not every lung tumor sample tested shows overexpression of the PR0539 
gene, the data in Exhibit B indicate that a significant portion of lung tumors do (30% of all lung 
tumors tested and 80% of squamous cell tumors tested), while only one of the normal lung tissue 
samples shows overexpression (14%). Given the known correlation between overexpression of a 
gene and the corresponding overexpression of the encoded protein, it is very likely that a similar 
number of lung tumors will overexpress the PR0539 protein, while very few normal lung tissue 
samples likely will. Together with the data reported in Example 16 that the gene encoding 
PR0539 is amplified in some lung tumors, including squamous cell lung carcinoma, the results 
reported in Exhibit B indicate that the PR0539 gene and protein, as well as antibodies to the 
encoded protein, can be used to differentiate some cancerous lung tissue, particularly squamous 
cell carcinoma, from normal lung tissue. Because not all lung tumors show overexpression of 
PR0539, it cannot be used to exclude a sample being tested as non-cancerous. However, the 
PR0539 gene, protein, and corresponding antibodies are useful as a diagnostic tool for lung 
cancer, particularly squamous cell carcinoma, since a very high percentage of squamous cell lung 
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EXHIBIT A 



VICTORIA SMITH 



Genentech Inc. 
Dept. Molecular Biology 
lDNAWay 
South San Francisco CA 94080 
Ph: (650) 225 7382 
Fax:(650)225 6497 
Email: victoria@gene.com 

Education 

Ph.D. (1991) Molecular Biology, University of Cambridge, Cambridge, United 
Kingdom. 

Honors (1987) Biochemistry, University of Western Australia, Australia. 
Bachelor of Science (1986) Physical and Inorganic Chemistry, and Biochemistry, 
University of Western Australia, Australia. 

work and Research Experience 

Senior Scientist, Genentech Inc (August 1996 - present; promoted to Senior Scientist 
March 2001) 

Lab head, Dept. Molecular Biology 

- Identification of potential therapeutic targets for cancer using novel microarray 
technology 

- Discovery and identification of novel secreted proteins 

- Development of cancer therapuetics 

Stanford University, California, U.S.A. (February 1995 - August 1996) 
Research Fellow, Department of Biochemistry 

Research: Genomic functional analysis of chromosome V of Saccharomyces cerevisiae 

Stanford University, California, U.S.A. (January 1992 - January 1995) 
Postdoctoral Fellow, Department of Genetics. 
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Research: Development of new methodology for whole genome functional analysis in 
microorganisms using genomic sequence data and insertional mutagenesis. 
Awards 

Human Frontiers Science Program Organization Long Term Fellowship (accepted 
4/01/93-1/31/95). * 

American Cancer Society (California Division) Fellowship (1993, declined). 

Cambridge University, United Kingdom {October 1988 - December 1991) 
Research undertaken at the Medical Research Council Laboratory of Molecular Biology, 
Cambridge, UK, for the degree of Doctor of Philosophy, Cambridge University. 
Thesis: A Molecular Genetic Analysis of Yeast Chromosome DC. 
Thesis Advisor: Dr. Barclay Barrell. 

Awards 

Max Perutz Prize in 1991 for outstanding performance by a graduate student. Awarded 
for advances in genomic-scale DNA sequencing methodology, and genetic analysis of 
the SNP1 gene of Saccharomyces cerevisiae . 

King Edward Memorial Hospital for Women, Western Australia (1988) 
Research Scientist. 

Research: Analysis of hormone-inducible mRNAs in breast tumors. 
University of Western Australia (1984 - 1987) 

1984 - 1986: Bachelor of Science degree with double major in Biochemistry and Physical 
and Inorganic Chemistry. 

1987: First Class Honors in Biochemistry. Thesis: Nuclease Sensitivity and Methylation 
Patterns of the Phenylalanine Hydroxylase Gene. 

Awards 

Association of Commonwealth Universities Scholarship for study in the United 
Kingdom (accepted, October 1988 - October 1991, University of Cambridge). 
Hackett Scholarship for overseas study (1988, declined). 

Lady James Prize in Natural Science in 1986: Best student completing Bachelor of 
Science degree with a major in a natural science. 

JWH Lugg Prize in Biochemistry in 1986: Best student completing major in 
Biochemistry. 
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Convocation Prize in Science in 1985: Best student completing major in second year 
Physics, Geology or Chemistry. 

Shell Prize in Chemistry in 1985: Best student completing major in second year 
Chemistry. 



Publications 

^? hen ' D ' Wieand ' TR Uada * NA - Won & AM - ^118, S. Paterson- 
Brown, T.D. Wu, J.Z. Tang K.J. Hillan, I.D. Penman, Expression Analysis of the 

Metaplasia-Dysplasia-Carcinoma Sequence in Barrett's Esophagus (submitted). 

Tice DA. Szeto W. Soloviev I. Rubinfeld B. Fong SE. Dugger DL. Winer J. Williams PM. 
Wieand D. Smith V. Schwall RH. Pennica D. Polakis P. Journal of Biological Chemistry 
277(16):14329-35, 2002 Apr 19. 

N.J. Maughan, F. Lewis, V.Smith (2001) Journal of Pathology 195, 3-6. 

F. Lewis, N.J. Maughan, V. Smith, K. Hillan, P. Quirke (2001) Journal of Pathology 195, 66- 

D.J. Garfinkel, M.J. Curcio and V.Smith (1998) Ty Mutagenesis Methods in Microbiology, 
volume 26, 101-117. 

C. Churcher, S. Bowman, K. Badcock, A. Bankier, D. Brown, T. Chillingworth, R. 
Connor, K. Devlin, S. Gentles, N. Hamlin, D. Harris, T. Horsnell, S. Hunt, K. Jagels, M. 
Jones, G. Lye, S. Moule, C. Odell, D. Pearson, M. Rajandream, P. Rice, N. Rowley, J. 
Skelton, V. Smith, S. Walsh, S. Whitehead & B. Barrel! (1997) Nature, 387, 84-87. ' 

F. S. Dietrich, J. Mulligan, K. Hennessy, M. A Yelton, E. Allen, R. Araujo, E. Aviles, A. 
Berno, T. Brennan, J. Carpenter, E. Chen, J. M. Cherry, E. Chung, M. Duncan, E. 
Guzman, G. Hartzell, S. Hunicke-Smith, R. W. Hyman, A. Kayser, C. Komp, D. 
Lashkari, H. Lew, D. Lin, D. Mosedale, K. Nakahara, A. Namath, R. Norgren, P. Oefner, 
C Oh, FX Petel, D. Roberts, P. Sehl, S. Schramm, T. Shogren, V. Smith, P. Taylor, Y. 
Wei, D.Botstein&R. W.Davis. (1997) Nature 387, 78-81. 
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V. Smith, K. Chou, D. Lashkari, D. Botstein, and P. O. Brown. (1996). Functional 
Analysis of the Genes of Yeast Chromosome V by Genetic Footprinting. Science 274, 
2069-74 

V- Smith, D - Botstein, and P. O. Brown (1995) Genetic Footprinting: A genomic strategy 
for determining a gene's function given its sequence. Proc. Natl Acad. Sci. U.SA., 92, 
6479-6483. 

v - Smith r M. Craxton, A. T. Bankier, C. M. Brown, W. D. Rawlinson, M. Chee, and B. G. 
Barrell (1995) Microtiter methods for the preparation and fluorescent sequencing of M13 
clones. Recombinant DNA Methodology II: a volume in the Selected Methods in Enzymology 
Series, pp. 607 - 621. 

V- Smith, M - Craxton, A. T. Bankier, C. M. Brown, W. D. Rawlinson, M. S. Chee, and B. 
G. Barrell (1993) Microtiter methods for the preparation and fluorescent sequencing of 
M13 clones. Methods in Enzymology, 218 , 173-187. 

V- Smith and M - S- Chee (1991) A simple method for sequencing the complementary 
strand of ssDNA from M13 clones. Nucleic Acids Research 19, 6957. 

V- Smith and B. G. Barren (1991) Cloning of a Yeast Ul snRNP 70K protein homologue: 
functional conservation of an RNA binding domain between humans and yeast. EMBO 
Journal 10, 2627-2643. 

W. D. Rawlinson, M. S. Chee, V.Smith and B. G. Barrell (1991) Preparation of large 
numbers of single stranded DNA templates by rescue from phagemids in microtiter 
plates. Nucleic Acids Research 19, 4779. 

V. Smith, C. M. Brown, A. T. Bankier, and B. G. Barrell (1990) Semi-automated 
preparation of DNA templates for large scale sequencing projects. DNA Sequence 1, 73- 
78. 

S. J. Wysocki, E. Hahnel, S. P. Wilkinson, V. Smith, and R. Hahnel (1990) Hormone- 
sensitive gene expression in breast tumors. Anticancer Research 10, 185-188 



4 



S. J. Wysocki, E. Hahnel, A. Masters, V. Smith. A. J. McCartney and R. Hahnel (1990) 
Detection of pS2 messenger RNA in gynecological cancers. Cancer Research 50, 1800- 
1802 



Patents 



Genetic Footprinting: Insertional Mutagenesis and Genetic Selection. U.S. Patent No. 
5,612,180. Inventors: Patrick Brown and Victoria Smith 

PATENTS FILED ( at Genentech Inc.) 

Methods of Detecting and Quantifying Gene Expression. Inventors: Victoria Smith, 
Edward Robbie, David Lowe, James Marsters. 

Compositions and Methods for the Treatment of Cancer. Inventors: Victoria Smith, 
Austin Gurney, Audrey Goddard, Fred DeSauvage 

Diagnostic for Dysplasia in Barrett's Esophagus. Inventor: Victoria Smith 



Numerous composition of matter filings related to novel gene discovery, pending 



EXHIBIT B 



Unqld DNAId 
Lung Tumor SamplBR 



Experiment Name 



Raw Ratio 
(sample/pooled 
epithelial) 



Normalized Ratio 
(sample/normal lung) 



340. 208,278. lung tumor R1 057 vs epi pool 

340. 208,278. lung tumor R1 094 vs epi pool 

340. 208,278. iung tumor R1372 vs epi pool 

340. 208,278. lung tumor R1417 vs epi pool 

340. 208,278. lung tumor R1431 vs epi pool 

340. 208,278. lung tumor R41 9 vs epi pool 

340. 208,278. lung tumor R542 vs epi pool 

340. 208,278. lung tumor R544 vs epi pool 

340. 208,278. lung tumor R685vs epi pool 

340. 208,278. lung tumor R688 vs epi pool 

340. 208,278. lung tumor R693 vs epi pool 

340. 208,278. lung tumor R737 vs epi pool 

340. 208,278. lung tumor R742 vs epi pool 

340. 295,413. lung tumor 

340. 208,278. lung tumor \R543 vs epi pool 

340. 208,278. lung tumdr1604/REF.RN A 

340. 208,278. lung tumorl 683 

340. 208,278. lung tumorl 853 

340. 208,278. lung tumor2454/ref.RNA 

340. 208,278. LungCaSq-hf1602 

340. 208,278. LungCaSq-hf1647 ' 

340. 208,278. LungSqCa-hf1293 

340. 208,278. LungSqCa-hf1646 

340. 208,278. LungSqCa-hf? 

340. 208,278. 1 00ngLungBaCa1 069 vs 25ngEpi1 409 

340. 208,278. 100ngLungBaCa994 vs 25ngEpi1409 



3.429 
1.716 
0.046 
1.106 
0.242 
1.413 
0.429 
0.735 
0.247 
0.897 
0.755 
1.508 
6.313 
3.213 
0.402 
1.06 

11.523 
0.067 

11.028 
1.103 
2.794 
4.728 
5.399 
6.306 
0.376 
1.706 



2.463 
.1.232 
0.033 
0.794 
0.174 
1.015 
0.308 
0.528 
0.177 
0.644 
0.542 
1.083 
0.225 
2.307 
0.289 
0.761 
8.275 
0.048 
7.920 
0.792 
2.007 
3*396 
3.877 
4.529 
0.270 
1.225 



Normal Luna Samples 

340. 208,278. N. lung R1 41 5 vs univ.ref 0 433 

340. 208,278. N. lung R1 431 vs ref. 0 567 

340. 208,278. N. lung R417 vs univ. ref 13 72 

340. 208,278. 1054LunglnflamTA2 1585 

340. 208,278. 1415LunglnfJamTA2 140 8 

340. 208,278. 423LunglnflamTA2 3V09 

340. 208,278. 488-552lnflmdLungTA2 0 673 



0.311 
0.407 
0.985 
1.138 
1.011 
2.664 
0.483 
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